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ALLERGENIC PROTEINS AND PEPTIDES 
PROM JAPANESE CEDAR POLLEN 

Backgrmin/i pf thp 1nva«^^„ 

na„„, ^"^^ "P ^« 10% of the 

populadon. beconie hypersemitized (allergic) to antigen, from a variety of 
environmental sources to which they are exposed. ITkjsc antigens that can induce 
^e^. and/or delayed types of hyper^nsitivity are Icno Jas ZZ ^Tg 
t^ r.. Adv. Immunol 23- 77- ins noiAw a ^ , . K'^g, 
amunoi. Z5. / / 105, (1976)). Anaphylaxis or atopy, which includes th^ 

^ Chay fever. asU™.. ^w^. is „ fe, of iJL. ^^^^ 

^^^-^.i^. food, drags, and cbemicab. 8«ses. m«, w«ds. 

Th. afflib«l,« i„,olv«l to aopie alfc^ t,, u, tte IgE class of 

m^M^obuU... teE btods «, c«„s and basophils. Upon LbiJl 7a 
^tfic allege. „iu, IgE bo^r K, nmt cells or basophils, tte IgE nav 
-ed ™ d« ce. surftce. ^ to tt. physiologic^ e*«s onZS» 

"|^on. Jhes.physi„l„gica, effect o^hute .he ^tease Of. ato4^ 
^"bmnccs. hsamtoe. sero«»to, heparin, a chen^ac* ficor for ^toolic 

leulcocj^ and/or fl» tenkottienes. «. W, and E4, Which ca,^ pro^S^ 
~^of bro^ sn^. ^ cells (HoM. L.B. e. al. Z^X (2nd 
cd^). lie Benianan/Qunmtog Publishtog Co.. Inc. (1984)). These .clcasj 
-b^Bcesared^n^^^,;, ^^J^^ 

~on Of IgE wid,, s^ allergen. Ihrongh d»n. d« efj^fan 
"ll^^a^n^ifes^. Suche,fcasn.,besys«.^„M„caltonanl 
on d. b. Which d„ andgen e^ d. hody and ^ pa^ Of depcSr^' 
^onn^cehsorb^. I«al „^fcs«d„ns gene,a«y c«c„r on^J 
-*c«a..ta lo^oo..whKh,heaIle:genen«,edd»body. SyslenUc Sr^l 
-tode anaphyla.. (anaphylac* shocB. which is d« resuh „f an ,gE-bas^ 

nsponsc to ciraUattogrmtravascular) antigen. 

Japanese cedar (Sugi; Oypumaiajcpomcal polltoosis is one of d^ n>ost 
^allergicdiseasestoJa^. Tlennn*erofpade«ssn,ftringl^r 
^« on d. tocre^ and to son. areas, n.. d^ 10% Of d« p^nlatio^ 

afficed. Treanaenroflapancse cedar polltoosis by adnunisoatfon of Japanese cedar 
pollen exttace ,o ef&c, hyposensid^on «, to allergen has been a«en»L 
Hyposensitization mfag Japanese cedar pollen exttac., however, has dn,wbacts to 
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that it can elicit anaphylaxis if high doses are used, whereas when low doses are used 
to avoid anaphylaxis, treatment must be continued for several years to build up a 
tolerance for the extract. 

The major allergen from Japanese cedar pollen has been purified and 
designated as Sugi basic protein (SBP) or Cry j 1. This protein is reported to be a 
basic protein with a molecular weight of 41-50 kDa and a pi of 8.8. There appear to 
be multiple isoforms of the allergen, apparendy due in part to differential 
glycosylation (Yasueda et al. (1983) /. Allergy Clin. Immunol 71: 77-86; and Taniai 
et al. (1988) FEBS Letters 239: 329-332. The sequence of the first twenty amino 
acids at the N-terminal end of Cry j I and a sixteen amino acid internal sequence 
have been determined (Taniai supra) . 

A second allergen has recently been isolated from the pollen of Cryptomeria 
japonica (Japanese cedar) (Sakaguchi et al. (1990) Allergy 45:309-312). This 
allergen, designated Cry j U, has been reported to have a molecular weight of 
approximately 37 kDa and 45 kDa when assayed on sodium dodecyl sulfate- 
polyacrylamide gel electrophoresis (SDS-PAGE) under non-reducing and reducing 
conditions, respectively (Sukaguchi et al., supra) . Cry j n was found to have no 
immunological cross-reactivity with Cry j I (Sakaguchi (1990) supra : Kawashima et 
al. (1992) /nr. Arch. Allergy ImnumoL 98:110-117). Most patients with Japanese 
cedar poUinosis were found to have IgE antibodies to both Cry j I and Cry j U. 
however, 29% of allergic patients had IgE that only reacted with Cry j I and 14% of 
allergic patients had IgE that only reacted with Cry j TL (Sakaguchi (1990) supra) . 
Isoelectric focusing of Cry j n indicated that this protein has a pi above 9.5, as 
compared to pi 8.6-8.8 for Cry j I (Sakaguchi (1990) supra) . Further, the reported 
NH2-terminal sequence for Cry j n, NH2-AlaIleAsnIIePheAsnValGluLysTyr- 
COOH, did not match that reported for Cry j I (Sakaguchi (1990) supra) . 

I>espite the attention J^anese cedar poUinosis allergens have received, 
definition or characterization of the allergens responsible for its adverse effects on 
people is far fi'om complete. Current desensitization therapy involves treatment with 
pollen extract with its attendant risks of anaphylaxis if high doses of pollen extract 
are administered, or long desensitization times when low doses of pollen extract are 
administered. 

Summary of the Invention 

The present invention provides nucleic acid sequences coding for the 
Cryptomeria japonica major pollen allergen Cry j n and fragments thereof. The 
present invention also provides purified Cry j n and at least one fragment thereof 
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produced in a host cell transformed with a nucleic acid sequence coding for Cryj H 
or at least one fragment thereof and fragments of Cry j n prepared synthetically. 
As used herein, a fragment of the nucleic acid sequence coding for the entire amino 
acid sequence of Cryj U refers to a nucleotide sequence having fewer bases than the 
nucleotide sequence coding for the entire amino acid sequence of Cry j n and/or 
mature Cryj H. Cryj n and fragments thereof are useful for diagnosing, treating, 
and preventing Japanese cedar pollmosis. This invention is more particularly 
described in the appended claims and is described in its preferred embodhnents in 
the following description. 

Description of the Drawings 

Fig. la shows an SDS-PAGE (12%) analysis of Cry j n under non-reducing 
conditions. 

Fig. lb shows an SDS-PAGE (12%) analysis of Cry j U under reducing 
conditions. 

Fig. 2 shows the results of mono S colunm chromatography of Cry j n eluted 
with a step gradient of NaCl in IQmM sodium acetate buffer, pH 5.0. 

Fig. 3 shows an SDS-PAGE (12%) of purified subfractions of Cry j n 
anal}rzed under reducing conditions. 

Fig. 4 shows the nucleic acid sequence (SEQ ID NO: 1) and the deduced 
amino acid (SEQ ID NO: 2) coding for Cry j H. 

Fig. 5 shows the deduced amino acid sequence of Cry j n (SEQ ID NO: 2). 
Fig. 6 shows the long form (SEQ ID NO: 4) and short form (SEQ ID NO: 5) 
NH2-tenninii amino acid sequences of Cry j U determined by protein sequence 
analysis as discussed in Example 2 aligned with the ten amino acid sequence of Cry 
j n (SEQ ID NO: 3) defined by Sakaguchi et al., supra (SEQ ID NO: 6). 

Fig. 7 is a graphic representation of the results of a direct EUSA assay 
showing the binding response of the monoclonal antibody 4B11 and seven patients' 
(Batch 1) plasma IgE to purified Cryj I as the coating antigen. 

Fig. 8 is a graphic representation of a direct EUSA assay showing the 
binding response of the monoclonal antibody 4B11, and seven patients* (Batch 1) 
plasma IgE to purified native CryjUas the coating antigen. 

Fig. 9 is a graphic representation of a direct EUSA assay showmg the 
binding response of the monoclonal antibody, 4B11, and seven patients' (Batch 1) 
plasma IgE to recombinant Cry j U (rCry y II) as the coatmg antigen. 

Fig. 10 is a graphic representation of a direct EUSA assay showing the 
binding response of eight patients' (Batch 2) plasma IgE to purified native Cryj I. 
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Fig. 11 is a graphic representation of a direct EUSA assay showing the 
binding response of eight patients' (Batch 2) plasma IgE to purified native Cry y H. 

Fig. 12 is a graphic representation of a direct EUSA assay showing the 
bmding response of eight patients* (Batch 2) plasma IgE to recombinant Cryj II. 
5 Fig. 13 is a graphic representation of a direct EUSA assay showing the 

bmding response of eight patients' (Batch 3) plasma IgE to purified native Cryj 1. 

Fig. 14 is a graphic representation of a direct EUSA assay showing the 
bmding response of eight patients' (Batch 3) plasma IgE to purified native Cryj H. 

Fig. 15 is a graphic representation of a direct EUSA assay showing the 
10 binding response of eight patients' (Batch 3) plasma IgE to recombmant Cryj H. 

Fig. 16 is a table which sununarizes both the MAST scores performed on 
patient's plasma samples (Batch 1-3) and tire duect EUSA results shown in Figs. 7- 
15; a positive response is indicated by a (+) sign and the number of positive 
responses for each antigen is shown at the bottom of each column. 

15 

T>pta11ftd n escription of the Invention 

The present invention provides nucleic acid sequences coding for Cryj II, an 
allergen found in Japanese cedar pollen. The nucleic acid sequence coding for Cryj 
n shown in Fig. 4 (SEQ ED NO: 1) encodes a protein of 514 amino acids. The 

20 deduced Cry j n amino acid sequence is shown in Figs. 4 and 5 (SEQ ID NO: 2) . 
Direct protein sequence analysis of native purified Cryj II resulted in two separate 
overlapping NH2-termini sequences, designated Long and Short, corresponding 
respectively to amino acids 46 through 89 (SEQ ID NO: 4) and 51 through 89 (SEQ 
ID NO: 5) of Figs. 4, 5 and 6. The ten amino acid sequence NH2-AlaIleAsnIlePhe- 

25 AsnValGluLysTry-COOH (SEQ ID NO: 6) previously defined by Sakaguchi et al, 
supra for Cry j U corresponds to amino acids 55 through 64 of Figs. 4 and 6, The 
fulHength Cry j n sequence contains 20 cysteine residues and three potential N- 
linked glycosylation sites with the consensus sequence of Asn-Xxx-Ser/Thr. 
According to the program contained in PC Gene, IntelUgenetics (Mountain View, 

30 CA) the proteins with the NH2-tenmni defined by the Long and Short forms of Cryj 
n would contain 469 and 464 amino acids, respectively, and have predicted 
molecular weights of 51.5 kDa Gong) and 50.9 kDa (short). The amino acid 
sequence representing ttie long form of Cryj U is encoded by the nucleotide 
sequence extending fi-om bases 177-1586 (SEQ ID NO: 7) as shown m Fig. 4, and 

35 the amino acid sequence representing the short form of Cryj U is encoded by the 
nucleotide sequence extending from 192-1586 (SEQ ID NO: 8) as shown in Fig. 4. 
A host cell transformed with a vector containing the cDNA insert coding for full- 
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Fiagments of the nucleic acid sequence coding for fragments of Cn- / n are 
akowithmthescopeoftheinvention. Fragments within the scope of the mvention 
mchide those coding for parts of Co^yn which induce an hnmune i^nse in 
mammals, preferably hmnans. such as stimulation of minimal amounts of IgE- 
bmdmg of IgE; eliciting the production of IgG and IgM antibodies; or the eliciting 
of a T cell response such as proliferation and/or lymphokine secretion and/or the 
induction of T cell anergy . The foregoing fiagments of C;y y n are referred to herein 
as antigemc fragments. Fragments within the scope of the invention also include 
those capable of hybridizing with nucleic acid from other plant species for use in 

screening protocols to detect aUergens that are cross-reactive with Oyy n As used 
heiem. a fragment of die nucleic acid sequence coding for Cryj n refers to a 
luicleotide sequence having fewer bases than the nucleotide sequence coding for the 
entue amino acid sequence of Co'/nand/or mature Cryyn. Generally, the nucleic 
acxd sequen^ coding for ti^ fragment or fragments of y H wUl be selected ftx,m 
tiie bases codmg for the mature protein, however, in some instances it may be 
desuable to select all or a part of a fragment or fragments from the leader sequence 
portion of the nucleic acid sequence of die invention. lUe mu:leic acid sequence of 
the mvention may also contain linker sequences, modified restriction endonuclease 
sites and other sequences useful for cloning, expression or purification of Cryj U or 
fiagments thereof . 

A micleic acid sequence coding for Cryj n may be obtained from 
Cryptomeria japoruca plants. Applicants have fomul that fresh poUen and staminate 
cones are a good source of Cryj H mRNA. It may also be possible to obtain the 
nucleic acid sequence coding for Cryj U fiom genomic DNA. Cryptomeria 
japonica is a weU-known species of cedar, and plant material may be obtained from 
wild, cultivated, or ornamental plants. The nucleic acid sequence coding for Cryj H 
may be obtained using the method disclosed heiein or any other suitable techniques 
for isolauon and cloning of genes. THe nucleic acid sequence of the invention may 
be DNA or RNA. ^ 

The present invention provides expression vectors and host cells transformed 
to express tiie nucleic acid sequences of tiie invention. Nucleic acid coding for Cryj 
n. or at least one fragment thereof may be expressed in bacterial ceUs such as E 
cob, msect cells (baculovinis). yeast, or mammalian cells such as Chinese hamster 
ovary ceUs (CHO). Suitable expression vectors, promoters, enhancers, and otiier 
expression control elements may be found m Sambrook et al. Molecular Cloning- A 



/ 

\ 
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Laboratory Marmal, second edition. Cold Spring Harbor Laboratory Press Cold' 
Spring Harbor. New York (1989). Other suitable expression vectors, promoters, 
enhancers, and other expression elements are known to those skUled in the art. 
Expression in mammalian, yeast or insect cells leads to partial or complete 
glycosylation of the recombinant material and formation of any inter- or intia-chain 
disulfide bonds. Suitable vectors for expression in yeast include YepSecl (Baldari et 
al. (1987) EmboJ. 6: 229-234); pMFa (Kurjan and Herskowitz (1982) Cell3Q- 933- 
943); JRY88 (Schultz et al. (1987) GeneSA:. 113-123) and pYES2 (Invitrogen 
Corporation, San Diego, CA). These vector^j are fteely available. Baculovirus and 
mammalian expression systems are also available. For example, a baculovirus 
system is commercially available (PharMingen, San Diego, CA) for expression in 
msect cells while the pMSG vector is commerically available (Pharmacia, 
Piscataway, NJ) for expression m mammalian cells. 

For expression in E. coli, suitable expression vectons include, among others 
pTRC (Amami et al. (1988) Gene 69;. 301-315); pGEX (Amrad Corp. , Melbourne ' 
Australia); pMAL (N.E. Biolabs. Beverly. MA); pRTTS (Pharmacia. Piscataway, ' 
NJ); pET-lld (Novagen. Madison, WI) Jameel et al., (1990) / Virol. 64:3963- 
3966; and pSEM (Knapp et al. (1990) BhTechniques 8: 280-281). The use of 
pTRC, and pET-lld, for example, will lead to the expression of unfused protein 
nie use of pMAL. pRITS pSEM and pGEX wiU lead to the expression of allergen 
fused to maltose E binding protein (pMAL). protein A (pRIT5). tnmcated 6- 
galactosidase (PSEM), or glutathione S-transferase (pGEX). When Cryj n 
fiagment. or fragments thereof is expressed as a fusion protein, it is particul'ariy 
advantageous to introduce an enzymatic cleavage site at the fusion jmiction between 
the earner protein and C/yy nor fiagment thereof. C/yy H or fiagment thereof 
may then be recovered from the fusion protein through enzymatic cleavage at the 
enzymatic site and biochemical purification using conventional techniques for 
purification of proteins and peptides. Suitable enzymatic cleavage sites include those 
for blood clotting Factor Xa or thrombin for which the appropriate enzymes and 
protocols for cleavage are commeiciaUy available ftom for example Sigma Chemical 
Company. St. Louis. MO and N.E. Biolabs. Beverly. MA. The different vectors 
also have differem promoter regions allowing constitutive or inducible expression 
with, for example. IPTG induction (PRTC. Amami et al.. (1988) aipra; pET-lld 
Novagen. Madison. WI) or temperatore induction (pRTTS. Pharmacia. Piscataway ' 
NJ) . It may also be appropriate to express recombinant Cry y n in different E. coli 
hosts that have an altered capacity to degrade recombinantly expressed proteins (e.g. 
U.S. patent 4,758,512). Alternatively, it may be advantageous to alter the nucleic 
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acid sequence to use codons preferentiaUy utilized by E. coli, where such nucleic' 
acid alteration would not affect the amino acid sequence of the expressed protein. 

Host cells can be transformed to express the nucleic acid sequences of the 
invention using conventional techniques such as calcium phosphate or calcium 
chloride co-precipitation. DEAE^xtran-mediated tnmsfection, or electroporation 
Smtable methods for transforming the host cells may be found m Sambrook et al 
SUES, and other laboratory textbooks. The nucleic acid sequences of the invention 
may also be synthesized using standard techniques. 

The present invention also provides a method of producing purified Japanese 
cedar pollen allergen Cry J U or at least one fragment thereof comprising the steps of 
culturing a host ceU transformed with a DNA sequence encoding Japanese cedar 
pollen allergen Cryj n or at least one fragment thereof in an appix)priate medhnn to 

produce a mixture of ceUs and medium containing said Japanese cedar pollen 
allergen Cryj H or at least one fragment thereof; and purifying the mixture to 
produce substantially pure Japanese cedar pollen aUergen Cryj n or at least one 
fragment thereof. Host cells transformed with an expression vector containmg DNA 
codmg for Cryj n or at least one firagment thereof are cultured m a suitable medimn 
for the host cell. C/yyn protein and peptides can be purified from ceU culture 
medram, host cells, or both usmg techniques known in the art for purifying peptides 
and proteins includmg ion-exchange chromatography, gel filtration chixMnatogiaphy, 
ultrafiltration, electrophoresis and immunopurification with antibodies specific for 
Cryj n or fragments thereof. The terms isolated and purified are used 
interchangeably herein and refer to peptides, protein, protein fragments, and nucleic 
acid sequences substantially free of cellular material or culture medmm when 
produced by recombmant DNA techniques, or chemical precursors. 

Cryj n protein may also be isolated from Japanese cedar pollen as described 
m Example!. C/yy H isolated directly from Japanese cedar poUen is referred to 
herein as "purified native" Cryj U. It is preferable that purified native Cryj U of 
the invention be at least 80% pure, and more preferably at least 90% pure and even 
more preferably be purified to homogeneity (at least 99% pure). 

Another aspect of the invention provides preparations comprismg Japanese 
cedar pollen allergen Cryj H or at least one fragment thereof synthesized in a host 
ceU transformed with a DNA sequence encoding all or a portion of Japanese cedar 
pollen aUergen Cryj H. or chemicaUy synthesized, and purified Japanese cedar 
pollen allergen Cryj n protein, or at least one antigenic fragment thereof produced 
in a host cell transformed with a nucleic acid sequence of the invention, or 
chemically synthesized. In preferred embodiments of the invention fbe CryJ U 
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protein is produced in a host ceU transfonned with the nucleic acid sequence coding 
for at least the mature Cry j n protein. 

Fragments of an allergen from Cryj n, eliciting a desired antigenic response 
(referred to herein as antigenic fragments) are defined herein as any protein fragment 
or peptide which can be derived from the Cryj H protems, but does not include the 
ten amino acid fragments which extends from amino acid residues 55-64, as shown 
in Figs. 4, 5 and 6, but may include any portion of diat ten amino acid fragment in 
conjunction with another fragment derived from Cryj H. Antigenic fragments of 
Cryj n may be obtained, for example, by screening peptides recombinanfly 
produced from die corresponding fragment of the nucleic acid sequence of the 
invention coding for such peptides, or by screening pqjtides which have been 
synthesized chemicaUy using techniques known in the art. or by scieening peptides 
produced by chemical cleavage of the allergen. The allergen may be arbitrarily 
divided into fragments of a desired length with no overlap of die peptides, or 
preferably divided into fragments of a desired length with no overlap of the peptides, 
oi- preferably divided into overlapping fragments of a desired lengdi. The fragments 
are tested to determine diefa- antigenicity (e.g. the ability of the fragment to induce 
an immune response such as T ceU proliferation as discussed in Example 7). 

Antigenic fragments may also be predicted using an algorithm such as diat 
discussed in a paper by HiU et al. Journal of Immunology, 147:184-197 (1991). 
Algorithms for predictmg peptides which elicit T ceU activity such as the algorithm 
discussed by Hill et al. are based on the protem's sequence wherem certain patterns 
withm the sequence are likely to bmd MHC and therefore may contam T cell 
epitopes. The peptides predicted by the algoridmi such as Cry j DA and Ciy j DB 
discussed in Example 7 may be produced recombinantly or synthetically and tested 
for T cell activity as discussed in Exan^le 7. 

If fragments of Japanese cedar pollen allergen, e.g. C/y y n are to be used for 
therapeutic purposes, then the fragments of Japanese cedar pollen allergen which are 
capable of eliciting a T ceU response such as stimulation (i.e., proliferation or 
lymphokine secretion) and/or are capable of inducing T cell anergy are particularly 
desirable and fragments of Japanese cedar poUen which have minimal IgE 
stimulatmg activity are also desirable. Additionally, for dierapeutic puiposes, 
purified Japanese cedar pollen allergens, e.g. Cryj D, and fragments thereof 
preferably do not bind IgE specific for Japanese cedar poUen or bind such IgE to a 
substantially lesser extent than the purified native Japanese cedar poUen allergen 
binds such IgE. If the purified Japanese cedar poUen aUergen or fragment or 
Augments thereof bind IgE. it is preferable that such binding does not result in die 
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release ofmediators(e.g.lustaniines) to mast cells or basophils. Minimal IgE 
stimulating activity refers to IgE stimulating activity that is less than the amount of 
IgE production stimulated by the native Cry j E protein. 

Isolated antigenic fragments or peptides of the present invention which have 
T cell stimulating activity, and dius comprise at least one T cell epitope are 
particularly desirable. T cell epitopes are beKeved to be involved in initiation and 
peipetuation of die immune response to a protein aUergen which is responsible for 
the clinical symptoms of aUeigy. These T ceU epitopes are thought to trigger early 
events at the level of the T helper cell by binding to an appropriate HLA molecule 
on die surface of an antigen presenting ceU and stimulating die relevant T cell 
subpopulation. These events lead to T ceU proliferation, lymphokine secretion, local 
inflammatoiy reactions, recruitment of additional immune cells to die site, and 
activation ofdieBceU cascade leading to production of antibodies. Oneisolypeof 
diese antibodies, IgE, is fimdamentally nnportant to the development of allergic 
symptoms and its production is influenced early in die cascade of events, at the level 
ofdieThelperceIl.bydienanireofdielympholdnessecreted. An epitope is die 
basic element or smallest unit of recognition by a receptor, particularly 
mimmioglobulins. histocompatibility antigens and T cell receptors, where die epitope 
comprises amino acids essential to receptor recognition. Amino acid sequences 
which mimic diose of die epitopes particularly T ceU epitopes and which modify die 
allergic response to protein aDergens mchiding diose capable of down regulating 
allergic response to Cry j E, are witiiin die scope of diis invention. 

As discussed in Example 7, human T ceU stimulating activity can be tested by 
culturing T cells obtained from an mdividual sensitive to Japanese cedar pollen 
allergen, (i.e.. an individual who has an IgE mediated immune response to Japanese 
cedar poUen aUergen) widi a peptide derived ftom die allergen and determinmg 

whedier proliferation of T cells occurs in response to die peptide as measured e g 
by cellular uptake of tritiateddiymidine. Stimulation indices for responses by T 
cells to peptides can be calculated as die maximum CPM m response to a peptide 
divided by die control CPM. A stimulation index (S.I.) equal to or greater dian two 
tunes die background level is considered "positive". Positive results are used to 
calculate die mean stimulation index for each peptide tested. Preferred peptides of 
diis invention comprise at least one T cell epitope and have a mean T cell stimulation 
mdex of greater tiian or equal to 2.0. A peptide having a mean T ceU stimulation 
index of greater dian or equal to 2.0 is considered usefid as a dieiapeutic agent. As 
shown in Fig. 17 Cry j E peptides Cry j EA and Cry j BB have mean stimulation 
indexes of at least two and dierefore comprise at least one T ceU epitope as 
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predicted. 

Purified protein allergens from Japanese cedar poUen or preferred antigenic 
fragments thereof, when administered to a Japanese cedar pollen-sensitive individual, 
or an individual allergic to an allergen cross-reactive with Japanese cedar pollen 
alleigen, are capable of modifying the aUergic response of the individual to Japanese 
cedar pollen or such cross-reactive allergen of the mdividual, and preferably are 
capable of modifying the B-cell response, T-ceU response or both the B-ceU and the 
T-ceD response of the individual to the allergen. As used herein, modification of 
the allergic response of an individual sensitive to a Japanese cedar pollen allergen 
can be defined as non-responsiveness or diminution in symptoms to the allergen, as 
determined by standard clinical procedures (See e.g. Vamey et al, British Medical 
Journal 3^:265-269 (1990)) including dimmution in Japanese cedar pollen 
induced asthmatic symptoms. As referred to herein, a diminution in symptoms 
includes any reduction in aUergic response of an individual to tiie allergen after die 
individual has completed a treatment regimen with a peptide or protein of the 
invention. This diminution may be subjective (i.e. the patient feels more 
comfortable in tiie presence of the aUergen). Diminution in symptoms can be 
determined clinically as weU, using standard skin tests as is known in flie art. 

The purified Cryy H protein or fiagments thereof are preferably tested in 
mammalian models of Japanese cedar poUinosis such as the mouse model disclosed 
in Tamura et al. (1986) Microbiol Immunol 30: 883-896, or U.S. patent 
4,939,239; or flie primate model disclosed in Chiba et al. (1990) Int. Arch. Allergy 
Immunol 93: 83-88. Initial screemng for IgE binding to die protein or fragments 
thereof may be performed by scrateh tests or intradermal skin tests on laboratory 
animals or human volunteers, or in in vitro systems such as RAST 
(radioallergosorbent test), RAST inhibition, EUSA assay, radionnmunoassay (RIA). 
or histamine release. 

Exposure of allergic individuals to purified protem allergens of die present 
invention or to die antigenic fragments of the present invention which comprise at 
least one T cell epitope and are derived from protein aUergens may tolerize or 
anergize appropriate T cell subpopulations such diat diey become unresponsive to die 
protem allergen and do not participate in stimulating an immune response upon such 
exposure. In addition, administration of die protein allergen of die invention or an 
antigenic firagment of the present invention which comprises at least one T cell 
epitope may modify the lymphokine secretion profile as compared wifli exposure to 
the namrally-occurring protein allergen or portion diereof (e.g. result in a decrease 
of IL-4 and/or an increase in IL-2). Furthermore, exposure to such antigenic 
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fragment or protein aUergen may influence T cell subpopulations which nonnally 
participate in the response to the allergen such that these T ceUs are drawn away 
from the site(s) of nonnal exposure to the allergen (e.g., nasal mucosa, skin, and 
lung) towards the site(s) of therapeutic administration of the fragment or protein 
allergen. This redistribution of T cell subpopulations may ameliorate or reduce the 
abUity of an individual's immune system to stimulate die usual immune response at 
the site of nonnal exposure to the allergen, resulting in a dimunution in allergic 
symptoms. 

The isolated Cryj n protein, and fragments or portions derived therefrom can 
be used in methods of diagnosing, treating and preventiiig allergic reactions to 
Japanese cedar poDen allergen or a cross reactive protein allergen. Thus the present 
invention provides therapeutic compositions comprising purified Japanese cedar 
poUen allergen Cryj H or at least one fragment thereof produced in a host cell 
transformed to express Cryj n or at least one fragment dieieof, and a 
pharmaceutically acceptable carrier or diluent. The therapeutic compositions of the • 
invention may also comprise synthetically prepared Cry j H or at least one fragment 
thereof and a pharmaceuticaUy acceptable carrier or diluent. Administration of the 
thaapeutic compositions of the present invention to an individual to be desensitized 
can be carried out using known techniques. Cryj H protein or at least one fragment 
thereof may be administered to an individual in combination with, for example, an 
appropriate diluent, a carrier and/or an adjuvant. Pharmaceutically acceptable 
diluents include saline and aqueous buffer solutions. PharmaceuticaUy acceptable 
carriers include polyethylene glycol (Wie et al. (1981) Int. Arch. Allergy Appl. 
Immunol. 64:84-99) and liposomes (Strejan et al. (1984) J. Neuroimmunoll: 27). 
For purposes of inducing T cell anergy, the therapeutic composition is preferably 
administered in nonimmunogenic form, e.g. it does not contain adjuvant. Such 
compositions will generally be administered by injection (subcutaneous, intravenous, 
etc.), oral administration, inhalation, transdermal application or rectal 
administration. The therapeutic compositions of die invention are administered to 
Japanese cedar pollen-sensitive individuals at dosages and for lengths of time 
effective to reduce sensitivity (i.e, reduce the allergic response) of the individual to 
Japanese cedar pollen. Effective amounts of the therapeutic compositions wiU vary 
according to factors such as die degree of sensitivity of the individual to Japanese 
cedar pollen, die age, sex, and weight of the individual, and die ability of die Cryj 
n protein or fragment diereof to elicit an antigenic response in the individual. 

The Cryj U cDNA (or die mRNA from which it was transcribed) or a 
portion diereof can be used to identity similar sequences in any variety or type of 
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plant and thus, to identify or "pull out" sequences which have sufficient homology to 
hybridize to tfie Cryj H cDNA or mRNA or portion thereof, for example, DNA 
from allergens of Cupressus sempervirens, Juniperus sabinoides etc. , under 
conditions of low stringency. Those sequences which have sufficient homology 
(generally greater than 40%) can be selected for ftulher assessment using the method 
described herein. Alternatively, high stringency conditions can be used. In this 
manner, DNA of the present invention can be used to identify, m other types of 
plants, preferably related families, genera, or species such as Juniperus, or 
Cupressus, sequences encoding polypeptides having amino acid sequences snnilar to 
that of Japanese cedar pollen allergen Cryj H, and thus to identify allergens in other 
species. Thus, the present invention includes not only Cry j H, but also other 
allergens encoded by DNA which hybridizes to DNA of the present invention. The 
invention further includes previously unidentified isolated allergenic proteins or 
fragments thereof that are immunologically related to Cry j n or fragments thereof, 
such as by antibody cross-reactivity wherem the isolated allergenic proteins or 
fragments thereof are capable of binding to antibodies specific for the protein and 
peptides of the invention, or by T cell cross-reactivity wherein the isolated 
allergenic proteins or fiagments thereof are capable of stimulating T cells specific for 
the protein and peptides of this invention. 

Proteins or pq)tides encoded by the cDN A of the present invention can be 
used, for example as "purified" aUergens. Such purified allergens are useful in the 
standardization of allergen extracts which are key reagents for the diagnosis and 
treatment of Japanese cedar pollinosis. Furthermore, by using peptides based on the 
nucleic acid sequences of Cryj n, anti-peptide antisera or monoclonal antibodies can 
be made using standard methods. These sera or monoclonal antibodies can be used 
to standardize allergen extracts. 

Through use of the peptides and protein of the present invention, preparations 
of consistent, well-defined composition and biological activity can be made and 
administered for therapeutic purposes (e.g. to modify the allergic response of a 
Japanese cedar sensitive individual to pollen of such trees). Administration of such 
peptides or protein may, for exanq)le, modify B-cell response to Cry j n allergen, 
modify T-cell response to Cryj II allergen or modify both B-cell and T-cell 
responses. Purified peptides can also be used to smdy the mechanism of 
inununotherapy of Cryptomeriaj^onica aUergy and to design modified derivatives 
or analogues useful in immunotherapy. 

Work by others has shown that high doses of allergens generally produce the 
best results (i.e., best symptom relief). However, many people are unable to 
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tolerate large doses of allergens because of allergic reactions to the allergens. 
Modification of naturally-occurring allergens can be designed in such a manner that 
modified peptides or modified allergens which have the same or enhanced 
therapeutic properties as the corresponding naturally-occurring allergen but have 
reduced side effects (especially anaphylactic reactions) can be produced. These can 
be, for example, a protein or peptide of the present invention (e.g., one having all or 
a portion of the amino acid sequence of Cry j II), or a modified protein or peptide, 
or protein or peptide analogue. 

It is possible to modify the structure of a protein or peptide of the invention 
for such purposes as increasing solubility, enhancing therapeutic or preventive 
efficacy, or stabili^ (e.g., shelf life ex vivo , and resistance to proteolytic 
degradation in vivo). A modified protein or peptide can be produced in which the 
amino acid sequence has been altered, such as by amino acid substitution, deletion, 
or addition, to modify immunogenicity and/or reduce allergenicity, or to which a 
component has been added for the same purpose. For example, the amino acid 
residues essential to T cell epitope function can be determined using known 
techniques (e.g., substitution of each residue and determination of the presence or 
absence of T cell reactivity). 

For example, a peptide can be modified so that it mainfamg the abflity 
to induce T cell anergy and bind MHC proteins without the ability to induce a strong 
proliferative response or possibly any proliferative response when administered in 
immunogenic form. In this instance, critical binding residues for the T cell receptor 
can be determined using known techniques (e.g. , substitution of each residue and 
determination of the presence or absence of T cell reactivity). Those residues shown 
to be essential to interact with the T cell receptor can be modified by replacing the 
essential amino acid with another, preferably similar amino acid residue (a 
conservative substitution) whose presence is shown to enhance, diminish but not 
eliminate binding to relevant MHC. 

Additionally, peptides of the invention can be modified by replacing an 
amnio acid shown to be essential to interact with the MHC protein complex with 
another, preferably similar amino acid residue (conservative substitution) whose 
presence is shown to enhance, diminish but not eliminate or not effect T cell 
activity. In addition, amino acid residues which are not essential for interaction with 
the MHC protein complex but which still bind the MHC protein complex can be 
modified by being replaced by another amino acid whose incorporation may 
enhance, not effect, or diminish but not eliminate T cell reactivity. Preferred amino 
acid substitutions for non-essential amino acids include, but are not limited to 
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substitutions with alanine, glutamic acid, or a methyl amino acid. 

Another example of a modification of protein or peptides is substitution of 
cysteine residues preferably with alanine, serine, threonine, leucine or glutamic acid 
to minimize dimerization via disulfide linkages. Another example of modification of 
the peptides of the invention is by chemical modification of amino acid side chains 
or cyclization of the peptide. 

In order to enhance stability and/or reactivity, the protein or peptides of the 
invention can also be modified to incorporate one or more polymorphisms in the 
amino acid sequence of the protein allergen resulting firom natural allelic variation. 
Additionally, I>-amino acids, non-natural amino acids or non-amino acid analogues 
can be substituted or added to produce a modified protein or peptide within the scope 
of this invention. Furthermore, proteins or peptides of the present invention can be 
modified using the polyethylene glycol (PEG) method of A. Sehon and co-workers 
(Wie et al. supra) to produce a protein or peptide conjugated with PEG. In addition, 
PEG can be added during chemical synthesis of a protein or peptide of the invention. 
Modifications of proteins or peptides or portions thereof can also include reduction/ 
alyklation (Tarr in: Methods of Protein Microcharacterizption, I.E. Silver ed. 
Humana Press, Clifton, NJ, pp 155-194 (1986)); acylation (Tarr, supra) : chemical 
coi^>ling to an appropriate carrier (Mishell and Shiigi, eds. Selected Methods in 
Cellular Immunology, WH Freeman, San Francisco, CA (1980); U.S. Patent 
4,939,239; or mild formalin treatment (Marsh International Archives of Allergy and 
Applied Immunology, 41:199-215 (1971)). 

To facilitate purification and potentially increase solubility of proteins or 
peptides of the invention, it is possible to add reporter group(s) to the peptide 
backbone. For example, poly-histidine can be added to a peptide to piuify the 
peptide on immobilized metal ion affinity chromatography (Hochuli, E. et al., 
Bio/Technology, 6:1321-1325 (1988)). In addition, specific endoprotease cleavage 
sites can be introduced, if desired, between a reporter group and amino acid 
sequences of a peptide to facilitate isolation of peptides free of irrelevant sequences. 
In order to successfully desensitize an individual to a protein antigen, it may be 
necessary to increase the solubility of a protein or peptide by adding functional 
groups to the peptide or by not including hydrophobic T cell epitopes or regions 
containing hydrophobic epitopes m the peptides or hydrophobic regions of the 
protein or peptide. 

To potentially aid proper antigen processing of T cell epitopes within a 
peptide, canonical protease sensitive sites can be recombinantly or synthetically 
engineered between regions, each comprising at least one T cell epitope. For 
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exanq,fc. charged amino acid pairs, such as KK or RR. can be introduced between 

regions within a peptide during recombinant construction of the peptide Hie 
resultmg peptide can be rendered sensitive to cathepsm and/or o^r trypsin-like 
enzymes cleavage to generate portions of the peptide containirig one or more T ceU 
epitopes, m addition, such charged amino acid residues can result in an increase in 
solubility of a peptide. 

Site^iirected mutagenesis of DNA encoding a peptide or protein of the 
mvendon (e.g. Cryjn or a ftagment thereof) can be used to modify the structure of 
the peptide or protein by methods known in the art. Such methods may, among 
odiers. include PGR with degenerate oligonucleotides (Hoetal. Gene 77-51 59 
(1989)) or total synthesis of mutated genes (Hostomsky, Z. et al.. Biochem. Biophys 
Res. Q>;n«..i61:I056-1063 (1989)). To enhance bacterial expression the ' 
aforementioned methods can be used in conjunction wid, other procedures to change 
the eucaiyotic codons in DNA constructs encoding protein or peptides of the 

mvemion to ones preferentially used in£. yeast, mammalian cells, or other 
eukaiyotic cells. 

Usmg the structural information now available, it is possible to design Cry / 
np^tides which, when administered ,0 a Japanese cedar pollen sensitive indivi^ 
in sufficient quantities, will modify die individual's aUergic response to Japanese 
cedar pollen. This can be done, for example, by examining die structure of Oyj U 

producmg peptides (via an expression system, synthetically or otherwise) to be ' 
e^^ for dieir ability to influence B^ll and/or T^U responses in Japanese 
cedar pollen sensitive individuals and selecting appropriate peptides which contain 
^toi^si^giuzed by the cells. It is now also possible to design an agent or a drug 
capable of blockmg or inhibiting die ability of Japanese cedar pollen allergen to 
uKiuce an aUergic reaction in Japanese cedar ponen sensitive individuals Such 
agents could be designed, for example, in such a mamier that fliey would bind to 
^ evant anti-C;,," n IgEs. dius preventing Ig^aUergen bmding and «ibsequent mast 
cell degranulation. Alternatively, such agents could bmd to cellular components of 
die mmmie system, resulting in suppression or desensitization of die allergic 
i^nse to Cryptomeria japonica poUen allergens. A non-restrictive example of 
^ IS die use of appropriate B- and T-.^ll epitope peptides, or modifications 
hereof, based on die cDNA/protein strucmres of die present invention to suppress 
Ae allergic response to Japanese cedar pollen. This can be carried out by defining 
die structures of B- and T-ceU epitope peptides which affect B- and T-cell function m 
in vuro snidies widi blood components from Japanese cedar pollen sensitive 
individuals. 
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Protein, pq)tides or antibodies of the present invention can ako be used for 
detecting and diagnosing Japanese cedar pollinosis. For example, this could be done 
by combining blood or blood products obtained from an individual to be assessed for 
sensitivity to Japanese cedar pollen with an isolated antigenic peptide or peptides of 
Cryj n, or isolated Cryj U protein, under conditions appropriate for binding of 
components m the blood (e.g., antibodies, T-cells, B-cells) with the peptide(s) or 
protein and determining the extent to which such binding occurs. Other diagnostic 
methods for allergic diseases which the protein, peptides or antibodies of the present 
invention can be used include radio-allergosorbent test (RAST), paper 
radioimmunosorbent test (PRIST), enzyme linked immunosorbent assay (ELISA), 
radioimmunoassays (RIA), immuno-radiometric assays (IRMA), luminescence 
immunoassays (UA), histamine release assays and IgE immunoblots. 

In another diagnostic test, the presence m individuals of IgE specific 
for Cryj II at least one protein allergen and the ability of T cells of the individuals to 
respond to T cell epitope(s) of Cryj II protein allergen can be determmed by 
administering to the individuals an Immediate Type Hypersensitivity test and a 
Delayed Type Hypersensitivity test. The mdividuals are administered an Immediate 
Type Hypersensitivity test (see e.g. Immunology (1985) Roitt, I.M., Brostoff, J., 
Male, D.K. (eds), C.V. Mosby Co., Gower Medical Publishing, London, NY, pp. 
19.2-19. 18; pp. 22. 1-22. 10) utilizing the Cry i 77 protein allergen or a portion 
thereof, or a modified form of the Cry j II protein allergen or a portion thereof, 
each of which binds IgE specific for the allergen. The same individuals are 
admmistered a Delayed Type Hypersensitivity test prior to, simultaneously with, or 
subsequent to administraiton of the Immediate Type Hypersensitivity test. Of 
course, if the Inunediate Type Hypersensitivity test is administered prior to the 
Delayed Type Hypersensitivity test, the Delayed Type Hypersensitivity test would be 
given to those individuals exhibiting a specific Immediate Type Hypersensitivity 
reaction. The Delayed Type Hypersensitivity test utilizes a modified form of tiie 
protein allergen or a portion thereof, the protein allergen produced recombinanUy , or 
a recombitope peptide derived from the protein allergen, each of which has human T 
cell stimulating activity and each of which does not bmd IgE sp^ific for the allergen 
in a substantial percentage of the population of individuals sensitive to the allergen 
(e.g., at least about 75%). Based on the results of the above diagnostic tests, those 
individuals found to have both a specific Immediate Type Hypersensitivity reaction 
and a specific Delayed Type Hypersensitivity reaction are suitable candidates for 
administration of a therapeutically effective amount of a therapeutic composition. 
The therapeutic composition comprises the modified form of the protein or portion 
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thereof, the recombinantly produced protein allergen, or the recombitope peptide, 
each as used in the Delayed Type Hypersensitivity test, and a pharmaceutically 
acceptable carrier or diluent. 

The present invention also provides a method of producing Cry j U or 
fragment thereof comprising culturing a host cell containing an expression vector 
which contains DNA encoding all or at least one fragment of Cry j U under 
conditions appropriate for e;q)ression of Cry j n or at least one fragment. The 
expressed product is then recovered, using known techniques. Alternatively, Cry j n 
or fragment thereof can be synthesized using known mechanical or chemical 
techniques. 

The DNA used in any embodiment of this invention can be cDNA obtained 
as described herein, or alternatively, can be any oligodeoxynucleotide sequence 
having all or a portion of a sequence represented herein, or tiieu- functional 
equivalents. Such oligodeoxynucleotide sequences can be produced chemically or 
enzymatically, using known techniques. A functional equivalent of an 
oligonucleotide sequence is one which is 1) a sequence capable of hybridizing to a 
complementary oligonucleotide to which the sequence (or corresponding sequence 
portions) of Cry j U or fragments thereof hybridizes, or 2) the sequence (or 
corresponding sequence portion) con^lementary to Cry j H, and/or 3) a sequence 
which encodes a product (e.g., a polypeptide or peptide) having the same functional 
characteristics of the product encoded by the sequence (or corresponding sequence 
portion) of Cry j n. Whether a functional equivalent must meet one or both criteria 
will depend on its use (e.g., if it is to be used only as an oligoprobe, it need meet 
only the first or second criteria and if it is to be used to produce a Cry j n allergen, 
it need only meet the third criterion). 

The invention is further illustrated by the following non-limiting examples. 

Example 1 

Purification of Native Japanese Cedar Pollen Allergen fCrv / ID 

The foUowing purification of native Cry j II from Japanese cedar pollen was 
modified from previously published reports (Yasueda et al, 7. Allergy Clin, 
Immunol 71:77 (1983); Sukaguchi et al., Allergy, 45:309 (1990)). 

lOOg of Japanese cedar pollen obtained from Japan (Hollister-Stier, Spokane, 
WA) was defatted in IL diethyl ether three times, the pollen was collected after 
filtration and the ether was dried off in a vacuum. 

The defatted pollen was extracted at APC overnight in 2L extraction buffer 
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containing 50 mM tris-HQ, pH 7.8, 0.2 M NaCl and protease inhibitors in final 
concentrations: soybean tiypsin inhibitor (2 ^g/mL), leupeptin (1 iig/mL), pepstatin 
A (1 vtg/mL) and phenyl methyl sulfonyl fluoride (0.17 mg/mL). The insoluble 
material was re-extrated with 1.2L extraction buffer at 4^C overnight and both 
extracts were combined together and depigmented by batch absorption with 
Whatman DE-52 (200g dry weight) equilibrated with the extraction buffer. 

The depigmented material was then fractionated by ammonium sulfate 
precipitation at 80% saturation (4^C), which removed much of the lower molecular 
weight material. The resulting pellet was resuspended in 0.4 L of 50 mM Na- 
acetate, pH 5.0 containing protease inhibitors and was dialyzed extensively against 
the same buffer. 

The sample was further subjected to purification by either one of the two 
methods described below. 

Method A 

The sample was applied to a 100 mL DEAE cellulose colunm (Whatman DE- 
52) equilibrated at 4^C with 50 mM Na-acetate, pH 5.0 with protease inhibitors. 
The unbound material (basic proteins) from the DEAE cellulose column was then 
applied to a 50 ml cation exchange column (Whatman CM-52) which was 
equilibrated with 10 mM Na-acetate, pH 5.0 at 4®C with protease inhibitors. A 
linear gradient of 0-0.3 M NaCI was used to elute the proteins. The early fractions 
were enriched in Cry j I whereas the later ftactions were enriched in Cry j n. 
Fractions containing Cry j U were pooled and next applied to an 1 mL Mono S HR 
5/5 column (Pharmacia, Piscataway, NJ) in 10 mM Na-acetate, pH 5.0, and proteins 
were eluted with a linear gradient of NaCl at room temperature. Residual Cry j I 
was ehited at "0.2 M NaCl and Cry j U was eluted between 0.3 to 0.4 M NaCl. 
The Cry j U peak was pooled and concentrated to twofold by lyophilization and 
subjected to gel filtmtion chromatography. 

The sample was ^lied to FPLC Superdex 75 16/60 column (Pharmacia, 
Piscataway, NJ) in 10 mM acetate buffer, pH 5.0 and 0.15 M NaCl at a flow rate of 
30 ml/min. at room temperature. Purified Cry j II was recovered in the 35-30 kD 
region. Cry j n migrated as two broad bands lower than Cry j I imder non-reducing 
conditions (Fig. la) but both bands shifted upward and migrated as Cry j I under 
reducing condition (Fig, lb) when analyzed by silver-stained SDS-PAGE. This 
highly purified Cry j U still contained a small amount ("5%) of Cryy I as detected by 
Western blot using MAb CBF2, which has been shown to bind to Cry j I and by N- 
terminal protein sequencing. This Cry j U preparation was used to generate primary 
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protein sequence of Cry j n as described below. 
Methods 

The dialyzed sample from the ammonimn sulfate precipitation was applied at 
1 ml/min to an 5.0 ml Q-Sepharose Econapac anion exchange cartridge (BioRad, 
Richmond, CA) equilibrated with 50 mM Na-acetate, pH 5.0 with protease 
inhibitors at A^C. Elution was performed with the above buffer containing 0.5 M 
NaCl. The basic unbound material was then applied to a 5.0 ml CM-Sepharose 
Econopac cation exchange cartridge (BioRad, Richmond, CA) equilibrated in 50 mM 
sodium acetate pH 5.0 with protease inhibitors. Basic protems were eluted with a 
linear gradient up to 0.1 M sodium phosphate pH 7.0, 0.3 M NaCl at 1 ml/min at 
Ar^C. A C/y 7 n -enriched peak was coUected late in the gradient and further 
purified by gel filtration chromatography. 

FPLC gel filtration was performed using a 320 mL Superdex 75 26/60 
(Pharmacia, Piscataway, NJ) column at 0.5 ml/min in 20 mM sodium acetate, pH 
5.0, in the presence of 0.15 M NaCl. The major peak containing mostly Cry j U 
eluted between 160 and 190 ml. Contaminating Cry j I was next removed by FPLC 
using a 1.0 ml Mono S 5/5 (Pharmacia, Piscataway, NJ) cation exchange column 
equilibrated with 10 mM sodium acetate pH 5.0. A stepwise gradient of 0-1 M 
NaCl was utilized by holding isocratically at 0.2 M, 0.3 M, 0.4 M and 1 M salt 
concentration. 

Multiple peaks (up to nine peaks) were obtained (Fig. 2) and analyzed by 
silver stained SDS-PAGE under reducing conditions (Fig. 3). Cry j I with a 
reported pi of 8.6-8.9 (Yasueda et al, 7. Allergy Clin. Immunol., vol. 17 (1983)), 
eluted in the earlier peaks and displayed a molecular weight of about 40 kD. Cry j U 
was purified to homogeneity as two bands (Fig. 3) and eluted in the later mult^le 
peaks, suggesting the existence of isoforms. EUSA analysis using the mouse 
monoclonal 8B11 IgG antibody which was raised against biochemically purified Cry 
j I confirmed the absence of Cry y I in these purified Cry j n preparation. This 
purified Cry J U was used in the human IgE reactivity studies (Example 6). 

Physical properties of Cry / n 

The physiochemical properties of Cry j n were studied and summarized as 
below. Under non-reducing SDS-PAGE conditions Cry j U consists of two bands 
with molecular weights ranged 34000-32000. The molecular weights of both bands 
are shifted higher to about 38-36 kD under reducing conditions (Fig. lb). This shift 
in SDS-polyacrylamide gel has also been observed by others (Sakaguchi et al. 



wo 94/1 1512 



PCT/US93/11000 



20 

AUergy45:309'312 (1990)). These results suggest that intra-disulfide bonds are 
probably present in the protein, and it is siq)ported by the present findings fliat 
cloned Cryj E contains 20 cysteines deduced from the nucleotide sequence (Example 
3). The pi of Cry j U estimated from lEF gel is about 10. The purified Cry j U 
binds human IgE of some allergic patients. 

The two molecular weight bands of Cry j R were separated on a 12% SDS- 
polyaciylamide gel and was then electroblotted onto PVDF membrane (Applied 
Biosystems, Foster City, CA). The blot was stained with coomassie brilliant blue 
and was cut and subjected to N-terminal amino acid sequencing. (Example 2). The 
results showed that the upper and lower molecular weight bands had identical N- 
terminal sequences except the lower molecular weight band missed the first five 
amino acids. The estimated molecular weight of the upper band based on the cDNA 
sequence is about 52,000, which is significandy higher than the molecular weight 
estimated from SDS-polyacrylamide gel either in the presence or absence of reducing 
reagent. It is also higher than that obtained from gel filtration and preliminary niass 
spectroscopy analysis. These are several possibilities to account for this difference. 
One possibility is that Cry j U protein is processed. It is probable that the N- 
terminal and C-terminal of the protein are cleaved. It is not clear at the present time 
whether this processing occurs in the cell or due to proteolysis during purification 
even though four different protease inhibitors were added in most of the purification 
steps. Nevertheless, the two N-terminal sequences obtained from the pxmfied Cry j 
n (Example 2) also contained the N-terminal sequence (10 amino acid) published by 
Sakaguchi et al (Allergy, 45:309-312(1990)) suggesting that the N-terminal of Cry j 
n is probably hydrolyzed. Since Sakaguchi et al. (supra) , did not use any protease 
inhibitors in their purification, a higher degree of hydrolysis might have occurred. 
This could explain why the N-terminal amino acid sequence that Sakaguchi et al. 
obtained was downstream of the N-terminal sequences as discussed in Example 2. 

Another approach which may be used to purify native Cry j U or recombinant 
Oy y n is immunoaffinity chromatography. This technique provides a very selective 
protein purification due to the specificity of the interaction between monoclonal 
antibodies and antigen. Murine polyclonal and monoclonal antibodies are generated 
against purified Cry j H. These antibodies are used for purification, 
characterization, analysis and diagnosis of the allergen Cry j H. 

Example 2 

Protein Sequencing of Purified Cry j U 

Cryj n protein was isolated as in Example 1. The doublet band shown on 
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SDS-PAGE (Fig, la) was electroblotted onto ProBlott (Applied Biosystems, Foster 
City, CA). Sequencing was perfonned with the Beckman/Porton Microsequencer 
(model LF3000, Beckman Instruments, Carlsbad, CA), a Programmable Solvent 
Module (Beckman System Gold Model 126, Beckman Instuments, Carlsbad, CA) 
and a Diode Array Detector Module for PTH-amino acid detection (Beckman System 
Gold Model 168, Beckman Instruments, Carlsbad, CA) following manufacturers 
specifications. 

A single N-terminal sequence analysis of the upper doublet band and multiple 
N-terminal sequence analyses of the lower doublet band showed that both bands 
contained two N-termini, designated "long" and "short". The lower doublet band 
contained approximately 3.3 picomoles of the long form and 8.3 picomoles of the 
short form. This difference in yields was sufficient to make sequence assignments 
according to the quantitation at each sequencer cycle. The upper doublet band 
contained approximately 8.3 picomoles of both sequences. The revealed long 
sequence was ira2-RKVEHSRHDAINnT*JVEKYGAVGIX}KH-DCT^^ 
()()() KNP ( ) -COOH, (SEQ ID NO: 4) where (Q) indicates a tentative 
identification of glutamine at position 38 and 0 indicated unknown residues at 
positions 39-41 and 45. The revealed "short" sequence was NH2- 
SRHDAINIFNVEKYGAVGDGKHDCTEAFSTAWS-COOH (SEQ ID NO: 5). 
Thus the long Cryj U sequ^ice had five additional amino terminal residues than the 
short form and the sequence of the short form exactly matched that of the long form. 
In addition, both the long and short forms of Cry j U contained the ten amino acids, 
NH2-AINIFNVEKY-COOH (SEQ ID NO: 6). previously described for Cry j U 
(Sakaguchi et al. 1990, supra) . The previously published ten amino acids 
(Sakaguchi et al. 1990, supra) correspond to amino acids ten through 19 of the long 
form described above. 

Example 3 

Extraction of RNA From Japanese Cedar PtoUen and Staminate Cones and 
Cloning of Crv/n 

Fresh pollen and staminate cone samples, collected from a single 
CTyptomeriajaponica (Japanese Cedar) tree at the Arnold Arboretum (Boston, MA), 
were frozen inmiediately on dry ice. RNA was prepared from 500 mg of each 
sample, essentially as described by Frankis and Mascariienas (1980) Ann. Bot, 45: 
595-599. The samples were ground by mortar and pestle on dry ice and suspended 
m 5 ml of 50 mM Tris pH 9.0 with 0.2 M NaCl, 1 mM EDTA, 0.1 % SDS that had 
been treated overnight with 0.1% diethyl pyrocarbonate (DEPC) . After five 
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extractions with phenol/chloroform/isoamyl alcohol (mixed 25:24:1), the RNA was 
precipitated from the aqueous phase with 0. 1 volume 3M sodium acetate and 2 
volumes ethanol. The pellets were recovered by centrifugation, resuspended in 2 ml 
dH20 and heated to 65 °C for 5 minutes. Two ml 4M lithhmi chloride was added to 
the preparation and the RNA was precipitated overnight at 0**C. The RNA pellets 
were recovered by centrifugation, resuspended m 1 ml dH20, and again precipitated 
with 3M sodium acetate and ethanol on dry ice for one hour. The final pellet was 
washed with 70% ethanol, ah: dried and resuspended m 100 fil DEPC-treated dH20 
and stored at -80°C. 

Double stranded cDNA was synthesized from 4 ^g pollen RNA or 8 fig 
flowerhead RNA using a commercially available kit (cDNA Synthesis System kit, 
BRL, Gaithersburg, MD). The double-stranded cDNA was phenol extracted, 
ethanol precipitated, blunted with T4 DNA polymerase (Promega, Madison, WI), 
and then ligated to ethanol precipitated, self annealed, AT and AL oligonucleotides 
for use in a modified Anchored PGR reaction, according to the method of Rafhar et 
al. (1990) J. Biol. Chem. 266: 1229-1236 ; Frohman etal. (1990) Proc. Natl. 
Acad. Sci. USA 85: 8998-9002; and Roux et al. (1990) BioTech. 8: 48-57. 
Oligonucleotide AT has the sequence (SEQ ID NO: 10) 
5'-GGGTCTAGAGGTACCG-TCCGTCCGATCGATCATT-3 * (Rafear et al. 
supra) . Oligonucleotide AL has the sequence (SEQ ID NO: 11) 
5'-AATGATCGATGCT (Rafiiar et al supra) . 

The first attempts at amplifying the amino terminus of Cry j n from the 
linkered cDNA (2 of a 20 /il reaction) was made using the degenerate 
oligonucleotide CP-11 and oligonucleotide AP. CP-11 has the sequence (SEQ ID 
NO: 12) 5'-ATACTTCTCIACGTTGAA-3', wherem A at positon 1 can be G, C at 
position 4 can be T, C at position 7 can be T, I at position 10 is inosme to reduce 
degeneracy (Knoth et al. (1988) Nucleic Acids Res. 16: 10932), G at position 13 can 
be A, and G at position 16 can be A). AP, which has the sequence (SEQ ID NO: 
13) 5*-GGGTCTAGAGGTA-CCGTCCG-3*, corresponds to nucleotides 1 fl^ough 
20 of the oligonucleotide AT. CP-11 is the degenerate oligonucleotide sequence that 
is complementary to the coding strand sequence substantially encoding amino acids 
PheAsnValGluLysTyr (SEQ ID NO: 14) (amino acids 59 to 64 of Fig. 4), which 
correspond to the carboxy terminus of the previously published Cry j U sequence 
(Sakaguchi et al. . supra) shown m Fig. 4. All oligonucleotides were synthesized by 
Research Genetics Inc., Himtsville, AL. 

Polymerase chain reactions (PGR) were carried out using a commercially 
available kit (GeneAmp DNA Amplification kit, Perkm Ehner CeUis. Norwalk, CT) 
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whereby 10 /xl lOx buffer containing dNTPs was mixed with 100 pmoles of each 
oligonucleotide, cDNA (3-5 ^1 of a 20 first strand cDNA reaction mix), 0.5 fil 
Amplitaq DNA polymerase, and distilled water to 100 fih 

The samples were amplified with a programmable thennal controller (MJ 
Research, Inc., Cambridge, MA). The first 5 rounds of amplification consisted of 
denaturation at 94**C for 1 min, annealmg of primers to the template at 45*'C for 1 
min, and chain elongation at 72**C for 1 min. The final 20 rounds of amplification 
consisted of denaturation as above, annealing at 55**C for 1 min, and elongation as 
above. The primary PGR reaction was carried out with 100 pmol each of the 
oligonucleotides AP and CP-11. Five percent (5 fil) of this initial amplification was 
then used in a secondary an^lification with 100 pmoles each of AP and CP-12. CP- 
12 has the sequence (SEQ ID NO: 15) 5'-CCTGCAGTACTTCT- 
CIACGTTGAAIAT-3', wherein C at position 10 can be T, C at position 13 can be 
T, I at positions 16 and 25 are mosines to reduce degeneracy as above, G at position 
19 can be A, and G at position 22 can be A. The sequence (SEQ ID NO: 16) 5'- 
CCTGCAG-3' (bases 1 through 7 of CP-12) represents a Pst I site added for 
cloning purposes; the remaining degenerate oligonucleotide sequence is 
complementary to the coding strand sequence that substantially encodes the amino 
acids IlePheAsnValGluLysTyr (SEQ ID NO: 17) (amino acids 58-64 of Fig. 4). 
Amplified DNA was recovered by sequential chloroform, phenol, and chloroform 
extractions, followed by precipitation on dry ice with 0.5 volumes of 7.5M 
ammonium acetate and 1.5 volumes of isopropanol. After precipitation and washing 
with 70% ethanol, the DNA was simultaneously digested with Xba I and Pi/ 1 m a 
50 fil reaction, precipitated to reduce the volume to 10 ^l, and electrophoresed 
through a preparative 2% GTG NuSeive low melt gel (FMC, Rockport, ME). The 
appropriate sized DNA area was visualized by ethidhmi bromide (EtBr) staimng, 
excised, and ligated into appropriately digested pUC19 for sequencmg by the 
dideoxy chain termmation method of Sanger et al. (1977) Proc. Natl. Acad. Sci. 
USA 74: 5463-5476) using a commercially available sequencmg kit (Sequenase kit, 
U.S. Biochemicals, Cleveland, OH). All resultant clones were sequenced, and none 
were found to contam Cry j n sequence. An alternate 2° PCR reaction was 
performed with AP and the nested oligonucleotide CP-2L CP-21 has the sequence 
(SEQ ID NO: 18) 5'-CCTGCAGTACTTCTCIACGTTGAAGAT-3' wherem C at 
position 10 can be T, C at position 13 can be T, I at position 16 is inosine to reduce 
degeneracy as above, G at position 19 can be A, G at position 22 can be A, and G at 
position 25 can be A or T. The sequence (SEQ ID NO: 16) 5'-CCTGCAG-3' (bases 
1 through 7 of CP-21) represent a Pst I site added for cloning purposes; the 
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remaining degenerate oligonucleotide sequence is the non-coding strand sequence 
corresponding to coding strand sequence substantially encoding amino acids 
DePheAsnValGluLysTyr (SEQ ID NO: 17) (amino acids 58 to 64 of Fig. 4). 

A primary PGR was also performed on double-stranded, linkered cDNA 
using CP-23D and AP, as above, to attempt to amplify the 3* end of the Cry j II 
cDNA. A secondary PGR was performed using 5% of the primary reaction, using 
GP-24D and AP. GP-23D (sequence (SEQ ID NO: 19) 5*- 
CjGIATTAATA lll l i AA-3 ' , wherein the T at position 6 can be G or A, T at 
position 9 can be G, T at position 12 can be G or A, and T at position 15 can be G ) 
is the coding strand sequence substantially encoding amino acids AlalleAsnllePheAsn 
(SEQ ID NO: 20) (amino acids 55 to 60 of Fig. 4); GP-24D (SEQ ID NO: 21) 
(sequence 5*-(3GAATTGG(K3ATTAATATTTTTAATGT-3', wherein the T at 
position 14 can be G or A, T at position 17 can be G, T at position 20 can be G or 
A, T at position 23 can be G, and T at position 26 can be G ) contains the sequence 
5'-GGAATTGG-3' (SEQ ID NO: 22) (bases 1 through 8 of GP-24). which 
represents an £co 72/^ site added for cloning purposes. The remaining degenerate 
oligonucleotide sequence of GP-24D substantially encodes amino acids 
AlalleAsnHePheAsnVal (SEQ ID NO: 23) (amino acids 55 to 61 of Fig. 4). Again, 
multiple clones were sequenced, none of which could be identified as Cry j H, and 
this approach was not pursued further. 

Upon the characterization of novel Cry j YL protein sequence data described in 
Example 2, new degenerate oligonucleotides for cloning Cry j U were designed and 
synthesized. All oligonucleotides mentioned hereafter were synthesized on an ABI 
394 DNA/RNA Synthesizer (Applied Biosystems, Foster City, GA), and purified on 
NAP-10 columns (Pharmacia, Uppsala, Sweden) as per the manufacturers* 
instructions. Degenerate oligonucleotide GP-35 was used with AP on the double- 
stranded linkered cDNA in a primary PGR reaction carried out as described herein. 
GP-35 has the sequence (SEQ ID NO: 24) 5*-GGTTGGGTAGAATGATGTTT-3', 
wherein T at position 3 can also be G; G at position 6 can also be A, T or G; A at 
position 9 can also be G; A at position 12 can also be G; A at position 15 can be G; 
and T at position 18 can also be G; this degenerate oligonucleotide sequence is the 
non-coding strand sequence corresponding to coding strand sequence substantially 
encoding amino acids LysHisAspGysThrGluAla of Cry j II (SEQ ID NO: 25) (amino 
acids 71 to 77 of Fig. 4). Five percent (5 ^1) of this initial amplification, designated 
JG136, was then used in a secondary amplification with 100 pmoles each of AP and 
degenerate Cry j II primer GP-36, an internally nested Cry j n oligonucleotide primer 
with the sequence (SEQ ID NO: 26) 5'- 
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GGCTGCAGGTACAATCATGTTTGCCATC-3' wherein A at position 11 can also 
be G; A at position 14 can also be G; A at position 17 can also be G; T at position 20 
can also be C; G at position 23 can also be A, T, or C; and A at position 26 can also 
be G. The nucleotides 5*-GGCTGCAG-3' (SEQ ID NO: 27) (bases 1 through 8 of 
CP-36) represent zPstI restriction site added for cloning purposes. The remaining 
degenerate oligonucleotide sequence of CP-36 is the non-coding strand sequence 
corresponding to coding strand sequence substantially encoding amino acids 
AspGIyLysHisAspCysThr of Cryj E (SEQ ID NO: 28) (amino acids 69 to 75 of Fig. 
4). The dominant amplified product, designated JC137, was a DNA band of 
approximately 265 base pairs, as visualized on an EtBr-stained 2% GTG agarose gel. 

Amplified DNA was recovered by sequential chloroform, phenol, and 
chloroform extractions, followed by precipitation at -20**C with 0.5 volumes of 7.5 
ammonkun acetate and 1.5 vohunes of isopropanoL After precipitation and washing 
with 70% ethanol, the DNA was simultaneously digested with Xba I and I in a 
15 fd reaction and electrophoresed dirough a preparative 2% GTG SeaPlaque low 
melt gel (FMC, Roclq)ort, ME). The appropriate sized DNA band was visualized 
by EtBr staining, excised, and ligated into appropriately digested pUC19 for 
sequencing by the dideoxy chain termination method (Sanger et al. (1977) Proc. Natl 
Acad ScL USA 74: 5463-5476) using a commercially available sequencing kit 
(Sequenase kit, U.S. Biochemicals. Cleveland, OH). 

The clones designated pUC19JC137a, pUC19JC137b, and pUC19JC137e 
were found to contain sequences encoding the amino terminus of Cry j n. All 
three clones had identical sequence in their regions of overlap, although all three 
clones had different lengths in die 5' untranslated region. Clone pUC19JC137b 
was the longest clone. The translated sequence of these clones had complete 
identity to the disclosed 10 amino acid sequence of Oyy U (Sakaguchi et al., 
supra.), as well as to the Cryj U amino acid sequence described in Example 2. 
Amino acid numbering is based on the sequence of the full length protein; amino 
acid 1 corresponds to the initiating methionine (Met) of Cry j H. The position of 
the initiating Met was supported by the presence of an upstream in-frame-stop 
codon and by 78% homology of the surroimding nucleotide sequence with the 
plant consensus sequence that encompasses the initiating Met, as reported by 
Lutcke et al. (1987) EMBOJ. 6:43-48. 

The cDNA encoding the remainder of Cry j U gene was cloned from the 
linkered cDNA by using oligonucleotides CP-37 (SEQ ID NO; 29) (which has the 
sequence 5'-ATGTTGGACAGTGTTGTCGAA-3') and AP in a primary PCR, 
designated JC138ii. Oligonucleotide CP-37 corresponds to nucleotides 129 to 149 of 
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Fig. 4, and is based on the nucleotide sequence detennined for die partial Cry j TL 
clone pUC19JC137b. 

A secondary PCR reaction was perfonned on 5% of the initial amplification 
mixture, with 100 pmoles each of AP and CP-38 (SEQ ID NO: 30) (which has the 
sequence 5*-GGGAATrCAGAAAAGTTGAGCATTCTCGT-3'), the nested primer. 
The nucleotide sequence (SEQ ID NO: 31) 5'-GGGAATTC-3' (bases 1 through 8 of 
CP-38) represents an Eco RI restriction site added for cloning purposes. The 
remaining oligonucleotide sequence corresponds to nucleotides 177 to 197 of Fig. 4, 
and is based on the nucleotide sequence detennined for the partial Cry j n clone 
pUC19JC137b, The amplified DNA product, designated JC140iii, was purified and 
precipitated as above, followed by digestion with Eco RI and Asp 718 and 
electrophoresis through a prq>arative 1% low melt gel. The dominant DNA band, 
which was approxunately 1.55 kb in length, was excised and ligated into pUC19 for 
sequencing. DNA was sequenced by the dideoxy chain termination method (Sanger 
et al. supra) using a conunercially available kit (sequenase kit (U.S. Biochemicals, 
Cleveland, OH). Both strands were completely sequenced using M13 forward and 
reverse primers (N.E. Biolabs, Beverly, MA) and internal sequencing primers CP- 
35, CP-38, CP-40, CP-41, CP-42, CP-43, CP-44, CP-45, CP-46, CP-47, CP-48, 
CP-49,CP-50, and CP-51. CP-40 (SEQ ID NO: 32) has the sequence 5*- 
GTTCTTCAATGGGCCATGT-3' and corresponds to nucleotides 359 to 377 of Fig. 
4. CP-41 (SEQ ID NO: 33) has the sequence 5'- GTGTTAGGACT- 
GTCTCTCGG-3 * , which is the non-coding strand sequence that corresponds to 
nucleotides 720 to 739 of Fig. 4. CP-42 (SEQ ID NO: 35) has the sequence 
5'-TGTCCAGGCCAT-GGAATAAG-3*, which corresponds to nucleotides 864 to 
883 of Fig. 4 except that the first nucleotide was synthesized as a T rather than the 
correct G. CP-43 has the sequence (SEQ ID NO: 35) 5*- 

GCCTrACATGGACTGCAACC-3', which is the non-coding strand sequence that 
corresponds to nucleotides 1476 to 1495 of Fig. 4. CP-44 has the sequence (SEQ 
ID NO: 36) 5'-TCCACGGGTCTGATAATCCA-3\ which corresponds to 
nucleotides 612 to 631 of Fig. 4. CP-45 has the sequence (SEQ ID NO: 37) 
5'-AGGCAGGAAGCAATTTT-CCC-3\ which is the non-coding strand sequence 
that corresponds to nucleotides 1254 to 1273 of Fig. 4. CP-46 has the sequence 
(SEQ ID NO: 38) 5'-TACTGCACTTCAGCT-TCTGC-3', which corresponds to 
nucleotides 1077 to 1096 of Fig. 4. CP-47 has the sequence (SEQ ID NO: 39) 
5'-GGGGGTCTCCGAATTTATCA-3', which is the non-coding strand sequence that 
substantially corresponds to nucleotides 1039 to 1058 of Fig. 4, except that the fifth 
nucleotide of CP-47 was synthesized as a G rather than the correct nucleotide, T. 
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CP-48 (SEQ ID NO: 40), which has the sequence 5'- 

GGATATTTCAGTGGACACGT-3', corresponds to nucleotides 1290 to 1309 of 
Fig. 4. CP-49 (SEQ ID NO: 41) has the sequence 5'-TATTAGAAGACC- 
CTGTGCCT-3\ which is the non-coding strand sequence that corresponds to 
nucleotides 821 to 840 of Fig. 4. CP-SO (SEQ ID NO: 42) has the sequence 
5'-CCATGTAAGGCCAAGTTAGT-3', which corresponds to nucleotides 1485 to 
1504 of Fig. 4. CP-51 (SEQ ID NO: 43) has the sequence 

5'-ACACCTTTACCCATTAGAGT-3*, which is the non-codmg strand sequence that 
corresponds to nucleotides 486 to 505 of Fig. 4. 

Three clones, designated pUC19JC140iiia, pUC19JC140iiid and 
pUC19JC140iiie, were subsequently found to contain partial Cry J U sequence. The 
sequence of clone pUC19JC140iiid was chosen as the consensus sequence since it 
had the longest 3' untranslated region. The sequences of pUC19JC140iiid and 
pUC19JC137b were used to construct the composite Cry j E sequence shown in Fig. 
4. In this composite, nucleotide 230 is reported as the A found in pUC19JC137b 
(also, pUC19JC137a, pUC19JC140iiia and pUC19JC140iiie) not as the G found in 
pUC19JC140iiid; however both A and G at nucleotide 230 encode Lys at amino acid 
63. The sequence of clone pUC19JC140iiia was identical to that of pUC19JC140iiid 
except for the following: pUC19JC140iiia has a T at nucleotide 357 in place of a C 
(no predicted change in amino acid 106), has C at nucleotide 754 mstead of T 
(changes amino acid 238 from lie to Thr), C at nucleotide 1246 instead of T 
(changes amino acid 402 from Leu to Pro), and T at nucleotide 1672 instead of C 
(untranslated region). The sequence of clone pUC19JC140iiie was identical to that 
of pUC19JC140iiid except for G at nucleotide 794 instead of A (changes amino acid 
251 from He to Met), and T at nucleotide 357 m place of C (no predicted change in 
amino acid 106). 

An earlier attempt at cloning the JC140iii PGR product using an Eco Rl/JCbfl 
I digest (oligonucleotide AP has both Xba I and Asp 718 restriction enzyme sites) 
yielded cDNA that was cut in half due to an internal Xba I restriction site in the Cry 
j n cDNA, giving rise to 800 and 750 bp bands; the 750 bp band was succesftilly 
cloned mto Eco BUXba I digested pUC19 and sequenced. Two 750 bp clones were 
sequenced and found to be the 5' half of the Cry j H molecule: clones pUC19JC140- 
2a and pUC19JC140-2b. Clone pUC19JC140-2a has C for nucloeotide 297 mstead 
of T (changes amino acid 86 from Cys to Arg) and clone pUC19JC140-2b has G for 
nucleotide 753 instead of A (changes amino acid 238 from He to Val). Both clone 
pUC19JC140-2a and clone pUC19JC140-2b have a T at nucleotide 357 m place of C 
(no predicted change in amino acid 106). 
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Two different PCR amplifications were also sequenced directly to verify the 
clonal Cryj U sequence using the Amplitaq Cycle Sequencing kit (Perkin Ehner 
Cetus, Norwalk, CT). This procedure involves the [32p]-end-labelling of 
oligonucleotide sequencing primers which are then annealled (1 .6 pmoles in 1 ^1) to 
template DNA and elongated with dideoxy NTPs (methodology of Sanger et al. 
(1977) Proc. Natl Acad, Sci. USA 74:5463-5476) in a PCR reaction also containing 
4 |il lOX Cycling Mix (contains 0.5 Wfil Anq)litaq DNA Polymerase), 5 ^1 template 
DNA (10-1(X) finoles) and dH20 to 20 ^1 . The dGTP in the termination mixes in 
this kit have been replaced by 7-deaza-dGTP, which provides increased resolution of 
sequences contaming high G+C regions of DNA. The template DNA was a PCR 
product that was recovered by sequential chloroform, phenol, and chloroform 
extractions, precipitated at -TXPC with 0,5 volumes of 7.5 ammonhmi acetate and 
1.5 volumes of isopropanol, then electrophoresed through a preparative 1 or 2% 
SeaPlaque low melt gel (FMC). Appropriate sized DNA bands were visualized by 
EtBr staining, excised, and treated with Gelase (Epicentre Technologies, Madison, 
WI) to remove the agarose. The DNA was again precipitated, and resuspended in 
50 ^1 TE (10 mM Tris, pH 7,4, 1 mM EDTA, pH 8.0) containing 20 fig/nd RNAse 
(Boehringer Mannheim, Indianapolis, IN). Two secondary amplifications which had 
been used to clone Cry j U were repeated, and used as template DNA for PCR cycle 
sequencing: JC137ii, the 5' end PCR, (anq)lified from the 1** PCR JC136 above) 
was reamplified with oligonucleotides AP and CP-36; and JC140ii, the 3' end PCR, 
(amplified from the 1° PCR JC138ii above) was reamplified with oligonucleotides 
AP and CP-38. Both of the 1 ° amplifications used were precipitated, 
electrophoresed through a preparative 1 or 2% SeaPlaque low melt gel (FMC), and 
the appropriate sized bands were visualized by EtBr staining and excised. Two ^il of 
each 1 amplification was then used in the corresponding 2** PCR reaction. The 2"* 
PCR product was then prepared as DNA template for PCR cycle sequencmg as 
described above. The oligonucleotides used as primers in PCR cycle sequencing, 
many of which were used to sequence the clones, are as follows: for JC137ii, CP-36 
and CP-39 (SEQ ID NO: 44). which has the sequence 5*- 
CTGTCCAACATAATTTGGGC-3' and is the non-coding strand sequence 
corresponding to nucleotides 120 to 139 of Fig, 4. The oligonucleotide primers used 
for sequencing JC140ii were CP-38, CP-40, CP-41, CP-42, CP-43, CP-44, CP-45, 
CP-46, CP-47, CP-49, CP-50. CP-54 (SEQ ID NO: 45), which has the sequence 5'- 
CATGGCAGGGTGGTTCAGGC-3', corresponds to nucleotides 985 to 1004 of Fig. 
4, CP-55 (SEQ ID NO: 46), which has the sequence . 

5'-TAGCCCCATTTACGTGCACG-3* and is the non-coding strand sequence that 
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corresponds to nucleotides 929 to 948 of Fig. 4, and CP-56 (SEQ ID NO: 47), 
which has the sequence 5*-TTGGGGTCGAGGCCTCCGAA-3 ' and corresponds to 
nucleotides 1437 to 1456 of Fig. 4. The sequence of this foil-length PGR cycle 
sequencing had only 2 nucleotide changes from the composite 
pUC19JC137b/pUC19JC140iiid Cry j n sequence shown in Figure 4, neither of 
which lead to an amino acid change. There was a T instead of C at nucleotide 357 
(no predicted change m ammo acid 106), and a C mstead of A at nucleotide 635 (no 
amino acid change). 

The nucleotide and predicted amino acid sequences of Cry j U are shown in 
Figs. 4 and 5. This is a composite imcleotide sequence from the two overlapping 
clones pUC19JC137b and pUC19JC140iiid, Sequencing of multiple independent 
clones and cycle sequencing of PGR pnxluct confirmed the nucleotide sequence of 
Figure 4. There were several nucleotide changes resulting in predicted amino acid 
changes, as cited above. However, all nucleotide polymorphisms, with the 
exception of the T for G substitition at nucleotide 357, were only observed in single 
clones or sequencmg reactions. Although T was seen at nucleotide 357 in aU clones 
except pUG19JG140iiid, both C and T encode Leu at ammo acid 106. 

The complete cDNA sequence for Cry y n is composed of 1726 nucleotides, 
including 41 nucleotides of 5' untranslated sequence, an open reading frame of 1542 
nucleotides starting with the codon for an initiating Met (nucleotides 42-44 of Fig. 
4), and a 143 bp 3' untranslated region. There is a consensus polyadenylation signal 
sequence in the 3* untranslated region 64 nucleotides 5' to the poly A tail 
(nucleotides 1654-1659 of Fig. 4). The position of the initiating Met is confirmed 
by the presence of an in-frame upstream stop codon and by 78 % homology with the 
plant consensus sequence that encompasses the mitiating Met (TAA AAUGG G (bases 
38 through 46 of Fig. 4 (SEQ ID NO: 48)) found in Cry y n compared with the 
AAGAAUGGG (SEQ ID NO: 49) consensus sequence for plants, Lutcke et al. 
(1987) EMBO /. 6: 43-48). Hie open reading frame encodes a deduced protein of 
514 amino acids that has complete sequence identity with the published partial 
protein sequence for Cry j II (Sakaguchi et al. supra) , which corresponds to amino 
acids 55 through 64 of Fig. 4. The predicted Cry j n protein has 20 Gys, contains 
four potential N-linked glycosylation sites correspondmg to the consensus sequence 
N-X-S/T, has a predicted molecular weight of 56.6 kDa and a predicted pl of 9.08. 

Detection of three separate NH2 termini sequences for Cry j n (the long form 
and the short form as determined in Example 2 and the NH2 terminus determined by 
Sakaguchi et al. , supra , as shown in Fig. 6) may suggest that the ammo terminus of 
the mature Cry j n protem is blocked and that the sequences obtained by sequence 
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analysis of purified protein represent proteolytic cleavage products. As shown in 
Fig. 6, the amino acid sequence of the long form of Cry j n begins at amino acid 46 
and the anuno acid sequence of the short form of Cry j n begins at amino acid 51; 
and the NH2-tenninal sequence determed by Sakaguchi et al. begins at amino acid 
54. It is also possible fliat amino acids 1 to 45 represent the leader/pre-pro position 
of Cry j n that is enzymatically cleaved to give a functionally active protein 
beginning at amino acid 46 of Fig. 4. The sequences beginning at amino acids 51 
and 54 represent breakdown products of the protein beginning at amino acid 46. 
There is a predicted cleavage site between amino acids 22 and 23 of Fig. 4 using the 
method of von Heijne (Nucleic Acids Res. (1986) 14:4683-4690). If the mature Cry 
j n protein started at amino acid 23 in Fig. 4, the protein would be 492 amino acids 
long with a predicted molecular weight of 54.2 kDa and a predicted pi of 9.0. 

Searching the Swiss-Prot data base with the Cry j n sequence demonstrated • 
that Cry y n is 43.3% homologous (33.3% identical to polygalacturonase of tomato 
(Lycopersicon esculentum) and 48.4% homologous (32.6% identical) to 
polygalacturonase of com, Zea mays. All nucleotide and amino acid sequence 
analyses were performed using PCGENE (Intelligenetics, Moimtain View, CA.). 
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Example 4 

Extracti n f RNA from Japanese Cedar Pollen Collected in Japan and 
Expression of Recombinant Cry j H 

Fresh pollen collected from a pool of Cryptomeria japonica (Japanese cedar) 
trees in Japan was frozen immediately on dry ice. RNA was prepared from 500 mg 
of the pollen, essentially as described by Frankis and Mascarenhas Am. Bot. 45:595- 
599. The samples were gromid by mortar and pestle on dry ice and suspended in 5 
mlof50niMTrispH9.0 with0.2MNaCI, ImMEDTA, 1 % SDS that had been 
treated overnight with 0.1% DEPC. After five extractions with phenol/chloroform 
/isoamyl alcohol (mixed at 25:24:1), die RNA was precipitated from the aqueous 
phase with 0.1 vohune 3 M sodhmi acetate and 2 vohmies ethanol. The pellets were 
recovered by centrifugation, resuspended in 2 ml dH20 and heated to 65^C for 5 
minutes. Two ml of 4 M lithium chloride were added to the RNA preparations and 
they were incubated overnight at O^C. The RNA pellets were recovered by 
centrifugation, resuspended in 1 ml dH20, and again precipitated with 3 M sodium 
acetate and ethanol overnight. The final pellets were resuspended in 100 dH20 
and stored at -80*^C. 

Double stranded cDNA was synthesized from 8 /ig pollen RNA using the 
cDNA Synthesis Systems kit (BRL) with oligo dT priming according to the method 
of Gubler and HofiEman (1983) Gene 25:263-269. PCRs were carried out using the 
GeneAmp DNA Amplification kit (Perkin Ehner Cetus) whereby 10 ^1 lOx buffer 
containing dNTPs was mfaced with 100 pmol each of a sense oligonucleotide and an 
anti-sense oligonucleotide, cDNA (10 ;il of a 400 ii\ double stranded cDNA reaction 
mbc), 0.5 id Amplitaq DNA polymerase, and distilled water to 100 jul. 

The samples were amplified with a programmable thermal controller from 
MJ Research, Inc. (Cambridge, MA). The first 5 rounds of amplification consisted 
of denaturation at 94*^C for 1 min, annealing of primers to the template at 45^C for 
1 min, and chain elongation at 72^C for 1 min. The final 20 rounds of amplification 
consisted of denaturation as above, annealing at 55^C for 1 min, and elongation as 
above. 

A new set of primer pairs was synthesized for amplification of a Cry j n 
cDNA from the initiatmg Met to the stop codon. CP-52 (SEQ ID NO: 50) has the 
sequence 5*- GCCGAATTCATGGCCATGAAATTAATT-3' where the nucleotide 
sequence 5'-GCCGAATTC-3' (SEQ ID NO: 51) (bases 1 through 9 of CP-52 
represents an Eco RI restriction site added for cloning purposes, and the remaining 
sequence corresponds to nucleotides 42 to 59 of Fig. 4. CP-53 (SEQ ID NO: 52) 
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has the sequence 5'-CGGGGATCCTCATTATGGATG-GTAGAT"3' where the 
nucleotide sequence 5'-CGGGGATCC-3* (SEQ ID NO: 53) (bases 1 through 9 of 
CP-S3 represents a Bam HI restriction site added for cloning purposes, and the 
remaining oligonucleotide sequence of CP-53 is complementary to coding strand 
sequence corresponding to nucleotides 1572 to 1589 of Fig. 4. The PGR reaction 
with CP-52 and CP-53 on the double stranded Japanese Cedar pollen cDNA yielded 
a band of approximately 1.55 kb on an EtBr-stained agarose minigel, and was called 
JC145. Anq)lified DNA was recovered by sequential chloroform, phenol, and 
chloroform extractions, followed by precipitation at -2(PC with 0.5 volumes of 7.5 
ammonium acetate and 1.5 volumes of isopropanol. After precipitation and washing 
with 70% ethanol, the DNA was simultaneously digested with Eco RI and Bam HI in 
a 15 ^1 reaction, and electrophoresed through a preparative 1% SeaPlaque low melt 
gel (FMC). Appropriate sized DNA bands were visualized by EtBr staining, 
excised, and ligatcd into appropriately digested pUC19 for sequencing by the 
dideoxy chain termination method (Sanger et al. (1977) Proc. Natl. Acad. ScL USA 
74:5463-5476) using a conmiercially available sequencing kit (Sequenase kit, U.S. 
Biochemicals, Cleveland, OH). 

Clones pUC19JC145a and pUC19JC145b were completely sequenced using 
M13 forward and reverse primers (N.E. Biolabs, Beverly, MA) and internal 
sequencing primers CP-41, CP-42, CP-44, CP-46, and CP-51. The nucleotide and 
deduced anuno acid sequences of clones pUC19JC145a and pUC19JC145b were 
identical to the Cry j n sequence of Fig. 4, with the following exceptions. Clone 
pUC19JC145a was foimd to contain a single nucleotide difference from the 
previously known Cry j YL sequence: it has a C at nucleotide position 1234 of Fig. 4 
rather than the previously described T. This nucleotide change results in a predicted 
amino acid change from lie to Thr at amino acid 398 of the Cry j II protein. Clone 
pUC19JC145b has a G at nucleotide position 1088 of Fig. 4 rather than the 
previously described A, and an A for a G at nucleotide 1339. The nucleotide change 
at 1088 is silent and does not result in a predicted amino acid change. The 
nucleotide change at position 1339 results in a predicted amino acid change from Ser 
to Asn at amino acid 433 of the Cry j H protein. None of these polymorphisms have 
yet been confirmed by independently-derived PGR clones or by direct amino acid 
sequencing and may be due to the inherent error rate of Taq polymerase 
(approximately 2 x 10-4, Saiki et al. (1988) Science 239:487-491). However, such 
polymorphisms in primary nucleotide and amino acid sequences are expected. 

Expression of Cry j U was performed as follows. Ten ^g of pUC19JC145b 
was digested sunultaneously with Eco RI and Bam HI. The nucleotide insert 
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encoding Cryj II (extending from nucleotide 42 through 1589 of Fig. 4) was 
isolated by electrophoresis of this digest through a 1 % SeaPIaque low melt agarose 
gel. The insert was then ligated into the appropriately digested expression vector 
pET-Ud (Novagen, Madison, WI; Jameel et al. (1990) /. ViroL 64:3963-3966) 
modified to contain a sequence encoding 6 histidines (His 6) immediately 3* of the 
ATG initiation codon followed by a unique Eco RI endonuclease restriction site. A 
second Eco RI endonuclease restriction site in the vector, along with neighboring Cla 
I and Hind HI endonuclease restriction sites, had previously been removed by 
digestion with Eco RI and HM. m, blunting and religation. The histidine (His6) 
sequence was added for affinity purification of the recombinant piotein (Cry y I) on a 
Ni2+ chelating cotamn (Hochuli et al. (1987) /. Chromatog. 411:177-184; Hochuli 
et al. (1988) Bio/Tech. 6:1321-1325.). A recombinant clone was used to transfonn 
Escherichia coli strain BL21-DE3, which harbors a plasmid that has an isopropyl-6- 
D-thiogalactopyranoside (IPTG)-inducible promoter preceding the gene encoding T7 
polymerase. Induction with IPTG leads to high levels of T7 polymerase expression; 
which is necessary for e)q>ression of the recombinant protein in pET-lld. Qone 
pET-lldAHRhis6JC145b.a was confirmed to be a Cry j U clone in the correct 
reading frame for expression by dideoxy sequencing (Sanger et al. supra) with CP- 
39. 

Ejq)ression of the recombinant protein was examined in an initial small 
culture. Anovernightcultureof clone pET-lldAHRhis6JC145b.a was used to 
innoculate 50 ml of media (Brain Heart Infusion Media, Difco) containing ampicillin 
(200 Mg/ml), grown to an A600 = 1 0 and then induced with IPTG (1 mM, final 
concentration) for 2 hrs. One ml aliquots of the bacteria were collected before and 
after induction, pelleted by centrifiigation, and crude cell lysates prepared by boiling 
the pellets for 5 minutes in 50 mM Tris HCl, pH 6.8, 2 mM EDTA, 1% SDS, 1% 
B-mercaptoethanol, 10% glycerol, 0.25% bromophenol blue (Studier et al,, (1990) 
Methods in Emymology 185:60-89). Recombinant protem expression was examined 
on a 12% Coomassie blue-stained SDS-PAGE gel, accordmg to the method in 
Sambrook et al. , supra , on which 25 id of the crude lysates were loaded. A negative 
control consisted of crude lysate from uninduced bacteria containing the plasmid 
with Cry j n. There was no notable increase in production of any recombinant E. 
coU protem in the range of 58 Kd, the size predicted for the recombinant Cry j U 
with the His6 leader. 

The pET-l ldAHRhis6JC145b.a clone was then grown on a larger scale to 
examine if there was any recombinant protein being expressed. A 2 ml culture of 
bacteria containing the recombinant plasmid was grown for 8 hr, then 3 ^1 was 
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spread onto each of 6 (100 x 15 mm) petri plates with 1.5% agarose in LB medium 
(Gibco-BRL, Gaithersburg, MD) contaimng 200 fig/nd ampicillin, grown to 
confluence overnight, then scraped into 6 L of liquid media (Brain Heart Infusion 
media, Difco) containing ampicillin (200 ^g/ml). The culture was grown until the 
absorbance at A600 was 1.0, IPTG added (1 mM final concentration), and the 
culture grown for an additional 2 hoiu^. 

Bacteria were recovered by centrifugation (7,930 xg, 10 min) and lysed in 50 
ml of 6M Guanidine-HCl, O.IM Na2HP04, pH 8.0, for 1 hour with vigorous 
shaking. Insoluble material was removed by centrifugation (11,000 xg, 10 min, 4^ 
C). The pH of the lysate was adjusted to pH 8.0, and the lysate applied to a 50 ml 
Nickel NTA agarose column (Qiagen) that had been equilibrated with 6 M 
Guanidme HCl, 100 mM Na2HP04, pH 8.0. The column was sequentially washed 
with 6 M Guanidine HCl, 100 mM Na2HP04, 10 mM Tris-HCl, pH 8.0, then 8 M 
urea, 100 mM Na2HP04, pH 8.0, and finally 8 M urea, 100 mM sodium acetate, 
10 mM Tris-HCl, pH 6.3. The colunm was washed with each buffer until the flow 
through had an A280<. 0.05. 

The recombmant Cryj n protein was eluted with 8 M urea, 100 mM sodhmi 
acetate, 10 mM Tris-HCl, pH 4.5, and collected in 10 ml aliquots. The protein 
concentration of each fraction was determined by A280 and the peak fractions 
pooled. An aliquot of the tollected recombinant protein was analyzed on SDS- 
PAGE according to the method in Sambrook et al, supra . 

This 6L prep, JCHpET-l, yielded 1.5 mg of recombinant Cry j H, which was 
resolved into 2 major bands on SDS-PAGE at 58 kDa and 24 kDa. The 58 kDa 
band, which represents recombinant Cry j n, was approximately 9-10% of the total 
protem as determined by densitometry measurement (Shimadzu Flying Spot Scaimer, 
Shimadzu Scientific Instruments, Inc., Bramtree, MA). The 24 kDa band accounts 
for about 90% of the total protein and may represent a degradation product of the 
recombinant Cry y n or an E, coli contaminant. 

Another Cryj U expression construct was made by the ligation of the 
pUC19JC140iiid Cryj n insert mto appropriately digested pETlldAHR (with the 6 
histidine leader). The vector was derived from another pETlldAHR construct 
whose insert supplied an EcoR I site (at the 5' pETlldAHR-insert junction) and an 
Asp 718 site (at the 3* end of the insert); the construct was digested with these two 
enzymes, run on a low melt minigel as above, and the vector recovered as a band in 
low meh agarose. The pUC19JC140iiid construct was digested with Eco R I and 
Asp 718 to release the Cryj U insert, which was isolated on a low mek minigel and 
ligated mto the Eco R I/Asp 718 digested pETl IdAHR vector prepared above. Five 
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clones were found to contain the correct nucleotide sequence at the insert/vector 5' 
junction, when sequenced by dideoxy sequencing (as above) with CP-39. This new 
construct, when expressed, would begin at amino acid 46 of Oy y n as shown in 
Figs. 4 and 5. This recombinant protem is designated rCry j n A46. A 50 ml small 
scale expression test (as performed above) showed that the expression level of rC/y j 
n A46 from this constnict, designated pETlldAHRJC140iiid2, would be much 
greater than the initial expression level from pETlldAHRJC145b2. A 9L prcp, 
JCnpET-3, was processed as above, and yielded 200 mg of rOy j U A46 at 80% 
purity as determined by densitometry of a Coomasie blue stained 12% SDS-PAGE 
gel. 

Example 5 

Northern blot on RNA from Japanese Cedar Pollen Sources 

A northern blot analysis was performed on the RNA isolated from Japanese 
Cedar pollen from both the Arnold Arboretum tree and the pooled trees from Japan. 
Using essentially the method of Sambrook, supra , ten iig of RNA isolated from 
Japanese cedar pollen collected from the Arnold Arboretum (Boston, MA) and 15 ^g 
pooled RNA from Japanese cedar pollen collected from trees in Japan were nm on a 
1.2% agarose gel containing 38% formaldehyde and IX MOPS (20X = 0.4M 
MOPS, 0.02M EDTA, O.IM NaOAc, pH 7.0) solution. The RNA samples (first 
precipitated with 1/10 volume sodnmi acetate, 2 volumes ethanol to reduce volume 
and resuspended in 5.5 /zl dH20) were run with 10 ii\ formaldehyde/fonnamide 
buffer containing loading dyes with 15.5% formaldehyde, 42% formamide, and 
1.3X MOPS solution, final concentration. The samples were transferred to 
Genescreen Plus (NEN Research Products, Boston, MA) by capillary transfer m lOX 
SSC (20X = 3M NaCl, 0.3M Sodmm Citrate), after which the membrane was 
baked 2 hrs at SO^C and UV irradiated for 3 minutes. Prehybridization of the 
membrane was at eO^C for 1 hour in 4 ml 0.5M NaPo4 (pH 7.2), ImM EDTA, 1% 
BSA, and 7% SDS. The antisense probe was synthesized by asymmetric PCR on the 
JC145 amplification in low melt agarose (above), where 2 ii\ DNA is amplified with 
2 III dNTP mix (0. 167mM dATP, 0.167mM dTTP, 0. 167mM dGTP, and 0.033mM 
dCTP), 2 id lOX PCR buffer, 10 aiI 32p^CTP (100 /tCi; Amersham, Arlington 
Heights, D). 1 yl (100 pmoles) antisense primer CP-53, 0.5 ii\ Taq polymerase, and 
dH20 to 20 /il; the lOX PCR buffer, dNTPs and Taq polymerase were from Perkin 
Elmer Cetus (Norwalk, CT). Amplification consisted of 30 rounds of denaturation 
at 940c for 45 sec, annealing of primer to the template at eO^C for 45 sec, and cham 
elongation at ll^C for 1 min. The reaction was stopped by addition of 100 /xl TE, 
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and the probe recovered over a 3cc G-50 spin column (2 ml G-50 Sephadex 
[Pharmacia, Uppsala, Sweden] in a 3cc syringe plugged with glass wool, 
equilibrated with TE) and counted on a 1500 TriCaib Uquid Scintillation Counter 
(Packard, Downers Grove, IL). The probe was added to the prehybridizing buffer at 
10^ cpm/ml and hybridization was carried out at 6(PC for 16 hrs. The blot was 
washed in high stringency conditions: 3x15 min at 65^0 with 0.2%SSC/1 % SDS, 
followed by wrapping in plastic wrap and exposure to film at -80^. A seven hour 
exposure of this Northern blot analysis revealed a single thick band at approximately 
1.7 kb for both RNA collected from the Arboretum tree and the RNA collected from 
the pooled trees from Japan. This message is the expected size for Cry 7 II as 
predicted by PCR analysis of the cDNA. 

Example 6 

Direct bmding assay of IgE to Cry i I> Crv i U and recombinant Cry i 11. 

Coming assay plates (#25882-96) were coated with Cry j I or Cry JEatl 
jig/mL or recombinant Cry j U preparation at 10 jig/mL (approximately 20% pure) 
in a volume of 50 jiL overnight at 4^C. The coating antigens were removed and the 
wells were blocked with 0.5% gelatin, PVP ^lyvinyl pyrolidine) 1 mg/ mL in 
PBS, 200 jiiL/well for 2 hours at room temperature. The anti-Cry j 1 monoclonal 
antibody, 4B11, was serially diluted in PBS-Tween 20 starting at a 1:1000 dilution. 
The himian plasma were serially diluted in PBS-Tween at a starting dilution of 1:2. 
For this set 23 plasma samples from patients symptomatic for Japanese cedar pollen 
allergy chosen for IgE binding analysis. The first antibody incubation proceeded 
overnight at 4^C. Following three washes with PBS-Tween the second antibodies 
were added (goat anti-mouse Ig or goat anti-human IgE botti at 1:2000) and 
incubated for two hours at room temperature at 100 ^iL/well. This solution was 
removed and streptavidin-HRPO diluted to 1:10,000, was added at lOOnL/well. The 
color was allowed to develop for 2-5 minutes. The reaction was stopped by the 
addition of lOO^iL/well of IM phosphoric acid. Plates were read on a Microplate 
IL310 Autoreader (Biotek Instruments, Winooski, VT) with a 450nm filter. The 
absorbance levels of duplicate wells were averaged. The graphed results (log of the 
dilution vs. absorbance) of the EUSA assays are shown in Figs. 7 to 15. The 
summary of the results are given in Fig. 16. A positive binding result, indicated by 
a plus sign is determined to be a reading of two-fold or greater above background 
(no first antibody) at the second dilution of plasma (1:6). 

In Fig. 7 the binding response of the monoclonal antibody, 4B11, and seven 
patients' (Batch 1) plasma IgE is shown to purified Cry y I as the coating antigen. 
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Example 7 
35 Pe ptides. 



Synthesis of Cryj D Peptides 
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Japanese cedar pollen Cry j n peptides designated Cry j HA Cry j IIB were 
synthesized using standard Fmoc/tBoc synthetic chemistry and purified by Reverse 
Phase HPLC. The amino acid sequence of peptide Cry J HA is FTFKVDGIIAAYQ 
(SEQ ID NO: 54) which corresponds to amino acids 116-128 as shown if Figs 4 and 
5. The amino acid sequence of peptide Cry j UB is NGYFSGHVIPACKN (SEQ ID 
NO: 55) which corresponds to amino acids 416-429 as shown in Figs 4 and 5. The 
peptide names are consistent throughout. 

T Cell Responses to Japanese Cedar Pollen Antigen Peptides 

Peripheral blood mononuclear cells (PBMC) were purified by lymphocyte 
separation medium (LSM) centrifiigation of 60 ml of heparinized blood firom one 
Japanese cedar pollen-allergic patient who exhibited clinical symptoms of seasonal 
liunitis and was MAST and/or skin test positive for Japanese cedar pollen. Long 
term T cell lines were established by stimulation of 2 X 10^ PBL/ml in bulk cultures 
of complete medmm (RPMI-1640, 2 mM L-glutamine, 100 U/ml 
penicillin/streptomycin, 5x10"5M 2-mercaptoethanol, and 10 mM HEPES 
supplemented with 5% heat inactivated human AB serum) with 10 fxg/ml of partially 
purified native Cry j U for 7 days at 37^C in a humidified 5% CO2 incubator to 
select for Cry j U reactive T cells. This amount of priming antigen was determined 
to be optimal for the activation of T cells from most Japanese cedar pollen allergic 
patients. Viable cells were purified by LSM centrifiigation and cultured in complete 
medium supplemented with 5 units recombinant human IL-2/ml and 5 units 
recombinant hunoan IL-4/mI for up to three weeks until the cells no longer responded 
to lymphokines and were considered "rested". The ability of the T cells to 
proliferate to peptides Cry j HA and Cry j HE, recombinant Cry j U (rCry j II), 
purified native Cry j n, or purified native Cry j I was then assessed. For assay, 2 X 
10^ rested cells were restimulated in the presence of 2 X 10^ autologous Epstein- 
Barr virus (EBV)-transformed B cells (prepared as described below) (gamma- 
irradiated with 25,000 RADS) with 2-50 ^g/ml of rCry j n, purified native Cry j 
n, peptides Cry j HA and Cry j HE, of purified native Cry 7 I, in a volume of 200 ^1 
complete medium in duplicate or triplicate wells in 96-well round bottom plates for 
2-4 days. The optimal incubation was found to be 3 days. Each well then received 
1 lid tritiated thymidine for 16-20 hours. The counts incorporated were collected 
onto glass fiber filter mats and processed for liquid scintillation counting. The 
maximmn response in a titration of each peptide is expressed as the stimulation index 
(S.I.). The S.I. is the counts per minute (CPM) incorporated by cells in response to 
peptide, divided by the CPM incorporated by cells in medium only. An S.I. value 



wo 94/11512 



PCT/US93/11000 



39 

equal to or greater than 2 times the background level is considered "positive" and 
indicates that the peptide contains a T cell epitope. The results of this assay 
indicated that peptides Cr j n, and Cry j IIB did noit appear to contain a T cell 
epitope for this particular allergenic patient. However, additional Japanese cedar 
pollen allergic patients will be tested in this assay system and one or both of these 
peptides may contain T cell epitopes for other allergic individuals. 
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Preparation of (EBV)-transfonned B Cells for Use as Antigen Presenting Cells 

Autologous EBV-transfonned cell lines were y-irradiated with 25,000 Rad 
and used as antigen presenting cells in secondary proliferation assays and secondary 
bulk stimulations. These cell lines were also used as a control in the immuno- 
fluorescence flow cytometry analysis. These EBV-transfonned cell Imes were made 
by incubating 5 X 10^ PBL with 1 ml of B-59/8 Marmoset cell line (ATCC 
CRL1612, American Type Culture Collection, Rockville, MD) conditioned medium 
in the presence of 1 ^ig/ml phorbol 12-myristate 13-acetate (PMA) at 37^C for 60 
minutes in 12 X 75 mm polypropylene round-bottom Falcon snap cap tubes (Becton 
Dickinson Labware, Lincoln Park, NJ). These cells were then diluted to 1.25 X 10^ 
cells/ml in RPMI-1640 as described above except supplemented with 10% heat- 
inactivated fetal bovine serum and cultured in 200 |iil aliquots in flat bottom culture 
plates until visible colonies were detected. They were then transferred to larger 
wells until the cell lines were established. 

Although the invention has been described with reference to its preferred 
embodiments, other embodiments, can achieve the same results. Variations and 
modifications to the present invention will be obvious to those skilled in the art and it 
is intended to cover in the appended claims all such modification and equivalents and 
follow in the true spirit and scope of this invention. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: IMMDLOGIC PHARMACEUTICAL CORPORATION 

(B) STREET: 610 Lincoln Street 

(C) CITY: Waltham 

(D) STATE: MA 

(E) COUNTRY: USA 

(P) POSTAL CODE (ZIP) : 02154 

(G) TELEPHONE: (617) 466-6000 

(H) TELEFAX: (617)466-6040 

(ii) TITLE OF INVENTION: Allergenic Proteins and Peptides From 

Japanese Cedar Pollen 

(iii) NUMBER OF SEQUENCES: 55 

(iv) COMPUTER READABLE FORM: 

(A) MEDIXJM TYPE: Floppy disk 

(B) CC»«PUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: ASCII (TEXT) 

(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Vanstone, Darlene 

(B) REGISTRATION NUMBER: 35,729 

(C) REFERENCE/DOCKET NUMBER: IPC-033PC 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 466-6000 

(B) TELEFAX: (617) 466-6040 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1726 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 42.. 1586 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

TGAGTTCGAG ACAAGTATAG AAAGAATTTT CTTTTATTAA A ATG GCC ATG AAA 

Met Ala Met Lys 
1 

TTA ATT GCT CCA ATG GCC TTT CTG GCC ATG CAA TTG ATT ATA ATG GCG 
101 

Leu He Ala Pro Met Ala Phe Leu Ma Met Gin Leu He He Met Ala 
5 10 15 20 

GCA GCA GAA GAT CAA TCT GCC CAA ATT ATG TTG GAC AGT GTT GTC GAA 
149 

Ala Ala Glu Asp Gin Ser Ala Gin He Met Leu Asp Ser Val Val Glu 
25 30 35 

AAA TAT CTT AGA TCG AAT CGG AGT TTA AGA AAA GTT GAG CAT TCT CGT 
197 

Lys Tyr Leu Arg Ser Asn Arg Ser Leu Arg Lys Val Glu His Ser Arg 
40 45 50 

CAT GAT GCT ATC AAC ATC TTC AAT GTG GAA AAG TAT GGC GCA GTA GGC 
245 

His Asp Ala He Asn He Phe Asn Val Glu Lys Tyr Gly Ala Val Gly 
55 60 65 

GAT GGA AAG CAT GAT TGC ACT GAG GCA TTT TCA ACA GCA TGG CAA GCT 
293 

Asp Gly Lys His Asp Cys Thr Glu Ala Phe Ser Thr Ala Trp Gin Ala 
70 75 80 

GCA TGC AAA AAC CCA TCA GCA ATG TTG CTT GTG CCA GGC AGC AAG AAA 
341 

Ala Cys Lys Asn Pro Ser Ala Met Leu Leu Val Pro Gly Ser Lvs Lvs 
85 90 95 100 

TTT GTT GTA AAC AAT CTG TTC TTC AAT GGG CCA T6T CAA CCT CAC TTT 
389 

Phe Val Val Asn Asn Leu Phe Phe Asn Gly Pro Cys Gin Pro His Phe 
105 110 115 

ACT TTT AAG GTA GAT GGG ATA ATA GCT GCG TAC CAA AAT CCA GCG AGC 
437 

Thr Phe Lys Val Asp Gly He He Ala Ala Tyr Gin Asn Pro Ala Ser 
120 125 130 

TGG AAG AAT AAT AGA ATA TGG TTG CAG TTT GCT AAA CTT ACA GGT TTT 
485 

Trp Lys Asn Asn Arg He Trp Leu Gin Phe Ala Lys Leu Thr Gly Phe 
135 140 145 

ACT CTA ATG GGT AAA GGT GTA ATT GAT GGG CAA GGA AAA CAA TGG TGG 

Thr Leu Met Gly Lys Gly Val He Asp Gly Gin Gly Lys Gin Trp Trn 
150 155 160 

GCT GGC CAA TGT AAA TGG GTC AAT GGA CGA GAA ATT TGC AAC GAT CGT 
581 

Ala Gly Gin Cys Lys Trp Val Asn Gly Arg Glu He Cys Asn Asp Aro 
165 170 175 180 

GAT AGA CCA ACA GCC ATT AAA TTC GAT TTT TCC ACG GGT CTG ATA ATC 
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Asp Arg Pro Thr Ala lie Lys Phe Asp Phe Ser Thr Gly Leu lie lie 
185 190 195 

CAA GGA CTG AAA CTA ATG AAC AGT CCC GAA TTT CAT TTA GTT TTT GGG 
677 

Gin Gly Leu Lys Leu Met Asn Ser Pro Glu Phe His Leu Val Phe Gly 
200 205 210 

AAT TGT GAG GGA GTA AAA ATC ATC GGC ATT AGT ATT ACG GCA CCG AGA 
725 

Asn Cys Glu Gly Val Lys lie lie Gly lie Ser He Thr Ala Pro Arg 
215 220 225 

GAC AGT CCT AAC ACT GAT GGA ATT GAT ATC TTT GCA TCT AAA AAC TTT 
773 

Asp Ser Pro Asn Thr Asp Gly He Asp He Phe Ala Ser Lys Asn Phe 
230 235 240 

CAC TTA CAA AAG AAC ACG ATA GGA ACA GGG GAT GAC TGC GTC GCT ATA 
821 

His Leu Gin Lys Asn Thr He Gly Thr Gly Asp Asp Cys Val Ala He 
245 250 255 260 

GGC ACA GGG TCT TCT AAT ATT GTG ATT GAG GAT CTG ATT TGC GGT CCA 
869 

Gly Thr Gly Ser Ser Asn He Val He Glu Asp Leu He Cys Gly Pro 
265 270 275 

GGC CAT GGA ATA AGT ATA GGA AGT CTT GGG AGG GAA AAC TCT AGA GCA 
917 

Gly His Gly He Ser He Gly Ser Leu Gly Arg Glu Asn Ser Arg Ala 
280 285 290 

GAG GTT TCA TAC GTG CAC GTA AAT GGG GCT AAA TTC ATA GAC ACA CAA 
965 

Glu Val Ser Tyr Val His Val Asn Gly Ala Lys Phe He Asp Thr Gin 
295 300 305 

AAT GGA TTA AGA ATC AAA ACA TGG CAG GGT GGT TCA GGC ATG GCA AGC 
1013 

Asn Gly Leu Arg He Lys Thr Trp Gin Gly Gly Ser Gly Met Ala Ser 
310 315 320 

CAT ATA ATT TAT GAG AAT GTT GAA ATG ATA AAT TCG GAG AAC CCC ATA 
1061 

His He He Tyr Glu Asn Val Glu Met He Asn Ser Glu Asn Pro He 
325 330 335 340 

TTA ATA AAT CAA TTC TAC TGC ACT TCA GCT TCT GCT TGC CAA AAC CAG 
1109 

Leu He Asn Gin Phe Tyr Cys Thr Ser Ala Ser Ala Cys Gin Asn Gin 
345 350 355 

AGG TCT GCG GTT CAA ATC CAA GAT GTG ACA TAC AAG AAC ATA CGT GGG 
1157 

Arg Ser Ala Val Gin He Gin Asp Val Thr Tyr Lys Asn He Arg Gly 
360 365 370 

ACA TCA GCA ACA GCA GCA GCA ATT CAA CTT AAG TGC AGT GAC AGT ATG 
1205 

Thr Ser Ala Thr Ala Ala Ala He Gin Leu Lys Cys Ser Asp Ser Met 
375 380 385 

CCC TGC AAA GAT ATA AAG CTA AGT GAT ATA TCT TTG AAG CTT ACC TCA 
1253 
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Pro Cys Lys Asp lie Lys Leu Ser Asp lie Ser Leu Lys Leu Thr Ser 
390 395 400 

GGG AAA ATT GCT TCC TGC CTT AAT GAT AAT GCA AAT GGA TAT TTC AGT 
1301 

Gly Lys lie Ala Ser Cys Leu Asn Asp Asn Ala Asn Gly Tyr Phe Ser 
405 410 415 420 

GGA CAC GTC ATC OCT GCA TGC AAG AAT TTA AGT CCA AGT GCT AAG CGA 
1349 

Gly His Val lie Pro Ala Cys Lys Asn Leu Ser Pro Ser Ala Lys Arg 
425 430 435 

AAA GAA TCT AAA TCC CAT AAA CAC CCA AAA ACT GTA ATG GTT GAA AAT 
1397 

Lys Glu Ser Lys Ser His Lys His Pro Lys Thr Val Met Val Glu Asn 
440 445 450 

ATG CGA GCA TAT GAC AAG GGT AAC AGA ACA CGC ATA TTG TTG GGG TCG 
1445 

Met Arg Ala Tyr Asp Lys Gly Asn Arg Thr Arg lie l,eu Leu Gly Ser 
455 460 465 

AGG CCT CCG AAT TGT ACA AAC AAA TGT CAT GGT TGC AGT CCA TGT AAG 
1493 

Arg Pro Pro Asn Cys Thr Asn Lys Cys His Gly Cys Ser Pro Cys Lys 
470 475 480 

GCC AAG TTA GTT ATT GTT CAT CGT ATT ATG CCG CAG GAG TAT TAT CCT 
1541 

Ala Lys Leu Val lie Val His Arg He Met Pro Gin Glu Tyr Tyr Pro 
485 490 495 500 

CAG AGG TGG ATA TGC AGC TGT CAT GGC AAA ATC TAC CAT CCA TAATGAGATA 
1593 

Gin Arg Trp He Cys Ser Cys His Gly Lys He Tyr His Pro 
505 510 

CATTGAAACT GTATGTGCTA GTGAATATTC TTGTGGTACA ATATTAGAAC TGATATTGAA 
1653 

AATAAATCAT CAATGTTTCT AAGGCATTTA TAATAGATTA TATTAATGGT TCA6CCTGGT 
1713 

GCAAAAAAAA AAA 
1726 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Met Lys Leu He Ala Pro Met Ala Phe Leu Ala Met Gin Leu 
15 10 15 

He He Met Ala Ala Ala Glu Asp Gin Ser Ala Gin He Met Leu Asp 
20 25 30 
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Ser Val Val Glu Lys Tyr Leu Arg Ser Asn Arg Ser Leu Arg Lys Val 
35 40 45 

Glu His Ser Arg His Asp Ala lie Asn He Phe Asn Val Glu Lys Tyr 
50 55 60 

Gly Ala Val Gly Asp Gly Lys His Asp Cys Thr Glu Ala Phe Ser Thr 
^5 70 75 80 

Ala Trp Gin Ala Ala Cys Lys Asn Pro Ser Ala Met Leu Leu Val Pro 
85 90 95 

Gly Ser Lys Lys Phe Val Val Asn Asn Leu Phe Phe Asn Gly Pro Cys 
100 105 110 

Gin Pro His Phe Thr Phe Lys Val Asp Gly He He Ala Ala Tyr Gin 
115 120 125 

Asn Pro Ala Ser Trp Lys Asn Asn Arg He Trp Leu Gin Phe Ala Lys 
130 135 140 

Leu Thr Gly Phe Thr Leu Met Gly Lys Gly Val He Asp Gly Gin Gly 
145 150 155 160 

Lys Gin Trp Trp Ala Gly Gin Cys Lys Trp Val Asn Gly Arg Glu He 
165 170 175 

Cys Asn Asp Arg Asp Arg Pro Thr Ala He Lys Phe Asp Phe Ser Thr 
180 185 190 

Gly Leu He He Gin Gly Leu Lys Leu Met Asn Ser Pro Glu Phe His 
195 200 205 

Leu Val Phe Gly Asn Cys Glu Gly Val Lys He He Gly He Ser He 
210 215 220 

Thr Ala Pro Arg Asp Ser Pro Asn Thr Asp Gly He Asp He Phe Ala 
225 230 235 240 

Ser Lys Asn Phe His Leu Gin Lys Asn Thr He Gly Thr Gly Asp Asp 
245 250 255 

Cys Val Ala He Gly Thr Gly Ser Ser Asn He Val He Glu Asp Leu 
260 265 270 

He Cys Gly Pro Gly His Gly He Ser He Gly Ser Leu Gly Arg Glu 
275 280 285 

Asn Ser Arg Ala Glu Val Ser Tyr Val His Val Asn Gly Ala Lys Phe 
290 295 300 

He Asp Thr Gin Asn Gly Leu Arg He Lys Thr Trp Gin Gly Gly Ser 
305 310 315 320 

Gly Met Ala Ser His He He Tyr Glu Asn Val Glu Met He Asn Ser 
325 330 335 

Glu Asn Pro He Leu He Asn Gin Phe Tyr Cys Thr Ser Ala Ser Ala 
340 345 350 

Cys Gin Asn Gin Arg Ser Ala Val Gin He Gin Asp Val Thr Tyr Lys 
355 360 365 

Asn He Arg Gly Thr Ser Ala Thr Ala Ala Ala He Gin Leu Lys Cys 
370 375 380 
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Ser Asp Ser Met Pro Cys Lys Asp He Lys Leu Ser Asp He Ser Leu 
385 390 395 400 

Lys Leu Thr Ser Gly Lys He Ala Ser Cys Leu Asn Asp Asn Ala Asn 
405 410 415 

Gly Tyr Phe Ser Gly His Val He Pro Ala Cys Lys Asn Leu Ser Pro 
420 425 430 

Ser Ala Lys Arg Lys Glu Ser Lys Ser His Lys His Pro Lys Thr Val 
435 440 445 

Met Val Glu Asn Met Arg Ala Tyr Asp Lys Gly Asn Arg Thr Arg He 
450 455 460 

Leu Leu Gly Ser Arg Pro Pro Asn Cys Thr Asn Lys Cys His Gly Cys 
465 470 475 480 



Ser Pro Cys Lys Ala Lys Leu Val He Val His Arg He Met Pro Gin 
485 490 495 

Glu Tyr Tyr Pro Gin Arg Trp He Cys Ser Cys His Gly Lys He Tyr 
500 505 510 

His Pro 



(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; peptide 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Arg Lys Val Glu His Ser Arg His Asp Ala He Asn He Phe Asn Val 
15 10 15 

Glu Lys Tyr Gly Ala Val Gly Asp Gly Lys His Asp Cys Thr Glu Ala 
20 25 30 

Phe Ser Thr Ala Trp Gin Ala Ala Cys Lys Asn Pro Ser 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 
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(xi) SEQUENCE DESC31IPTI0N: SEQ ID N0:4; 

Arg Lys Val Glu His Ser Arg His Asp Ala He Asn He Phe Asn Val 
15 10 15 

Glu Lys Tyr Gly Ala Val Gly Asp Gly Lys His Asp Cys Thr Glu Ala 
20 25 30 

Phe Ser Thr Ala Trp Gin Lys Asn Pro 
35 40 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: eimino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Ser Arg His Asp Ala He Asn He Phe Asn Val Glu Lys Tyr Gly Ala 
15 10 15 

Val Gly Asp Gly Lys His Asp Cys Thr Glu Ala Phe Ser Thr Ala Trp 
20 25 30 

Gin Lys Asn Pro 
35 

(2) INFORMATION FOR SEQ ID N0:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAOIENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Ala He Asn He Phe Asn Val Glu Lys Tyr 
1 5 10 . 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1410 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

AGAAAAGTTG AGCATTCTCG TCATGATGCT ATCAACATCT TCAATGTGGA AAAGTATGGC 
60 

GCAGTAGGCG ATGGAAAGCA TGATTGCACT GAGGCATTTT CAACAGCATG GCAAGCTGCA 
120 

TGCAAAAACC CATCAGCAAT GTTGCTTGTG CCAGGCAGCA AGAAATTTGT TGTAAACAAT 
180 

CTGTTCTTCA ATGGGCCATG TCAACCTCAC TTTACTTTTA AGGTAGATGG GATAATAGCT 
240 

GCGTACCAAA ATCCAGCGAG CTGGAAGAAT AATAGAATAT GGTTGCAGTT TGCTAAACTT 
300 

ACAGGTTTTA CTCTAATGGG TAAAGGTGTA ATTGATGGGC AAGGAAAACA ATGGTGGGCT 
360 

GGCCAATGTA AATGGGTCAA TGGACGAGAA ATTTGCAACG ATCGTGATAG ACCAACAGCC 
420 

ATTAAATTCG ATTTTTCCAC GGGTCTGATA ATCCAAGGAC TGAAACTAAT GAACAGTCCC 
480 

GAATTTCATT TAGTTTTTGG GAATTGTGAG GGAGTAAAAA TCATCGGCAT TAGTATTACG 
540 

GCACCGAGAG ACAGTCCTAA CACTGATGGA ATTGATATCT TTGCATCTAA AAACTTTCAC 
600 

TTACAAAAGA ACACGATAGG AACAGGGGAT GACTGCGTCG CTATAGGCAC AGGGTCTTCT 
660 

AATATTGTGA TTGAGGATCT GATTTGCGGT CCAGGCCATG 6AATAAGTAT AGGAAGTCTT 
720 

GGGAGG6AAA ACTCTAGAGC AGAGGTTTCA TACGTGCACG TAT^TGGGGC TAAATTCATA 
780 

GACACACAAA ATGGATTAAG AATCAAAACA TGGCAGGGTG GTTCAGGCAT GGCAAGCCAT 
840 

ATAATTTATG AGAATGTTGA AATGATAAAT TCGGAGAACC CCATATTAAT AAATCAATTC 
900 

TACTGCACTT CAGCTTCTGC TTGCCAAAAC CAGAGGTCTG CGGTTCAAAT CCAAGATGTG 
960 

ACATACAAGA ACATACGTGG GACATCAGCA ACAGCAGCAG CAATTCAACT TAAGTGCAGT 
1020 

GACAGTATGC CCTGCAAAGA TATAAAGCTA AGTGATATAT CTTTTGAAGCT TACCTCAGGG 
1080 

AAAATTGCTT CCTGCCTTAA TGATAATGCA AATGGATATT TCAGTGGACA CGTCATCCCT 
1140 

GCATGCAAGA ATTTAAGTCC AAGTGCTAAG CGAAAAGAAT CTAAATCCCA TAAACACCCA 
1200 

A7UUVCTGTAA TGGTTGAAAA TATGCGAGCA TATGACAAGG GTAACAGAAC ACGCATATTG 
1260 

TTGGGGTCGA GGCCTCCGAA TTGTACAAAC AAATGTCATG GTTGCAGTCC ATGTAAGGCC 
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1320 

AAGTTAGTTA TTGTTCATCG TATTATGCCG CAGGAGTATT ATCCTCAGAG GTGGATATGC 
1380 

AGCTGTCATG GCAAAATCTA CCATCCATAA 
1410 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1395 base pairs 

(B) TYPE: nucleic acid 

(C) STR7\NDEDNESS : single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

TCTCGTCATG ATGCTATCAA CATCTTCAAT GTGGAAAAGT ATGGCGCAGT AGGCGATGGA 
60 



AAGCATGATT GCACTGAGGC ATTTTCAACA GCATGGCAAG CTGCATGCAA AAACCCATCA 
120 

GCAATGTTGC TTGTGCCAGG CAGCAAGAAA TTTGTTGTAA ACAATCTGTT CTTCAATGGG 
180 

CCATGTCAAC CTCACTTTAC TTTTAAGGTA GATGGGATAA TAGCTGCGTA CCAAAATCCA 
240 

GCGAGCTGGA AGAATAATAG AATATGGTTG CAGTTTGCTA AACTTACA6G TTTTACTCTA 
300 

ATGGGTAAAG GTGTAATTGA TGGGCAAGGA AAACAATGGT GGGCTGGCCA ATGTAAATGG 
360 

GTCAATGGAC GAGAAATTTG CAACGATCGT GATAGACCAA CAGCCATTAA ATTCGATTTT 
420 

TCCACGGGTC TGATAATCCA AGGACTGAAA CTAATGAACA GTCCCGAATT TCATTTAGTT 
480 

TTTGGGAATT GTGAGGGAGT AAAAATCATC GGCATTAGTA TTACGGCACC GAGAGACAGT 
540 

CCTAACACTG ATGGAATTGA TATCTTTGCA TCTAAAAACT TTCACTTACA AAAGAACACG 
600 

ATAGGAACAG GGGATGACTG CGTCGCTATA GGCACAGGGT CTTCTAATAT TGTGATTGAG 
660 

GATCTGATTT GCGGTCCAGG CCATGGAATA AGTATAGGAA GTCTTGGGAG 6GAAAACTCT 
720 



AGAGCAGAGG TTTCATACGT GCACGTAAAT GGGGCTAAAT TCATAGACAC ACAAAATGGA 
780 

TTAAGAATCA AAACATGGCA GGGTGGTTCA GGCATGGCAA GCCATATAAT TTATGAGAAT 
840 
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GTTGAAATGA TAAATTCGGA GAACCCCATA TTAATAAATC AATTCTACTG CACTTCAGCT * 
900 

TCTGCTTGCC AAAACCAGAG GTCTGCGGTT CAAATCCAAG ATGTGACATA CAAGAACATA 
960 

CGTGGGACAT CAGCAACAGC AGCAGCAATT CAACTTAAGT GCAGTGACAG TATGCCCTGC 
1020 

AAAGATATAA AGCTAAGTGA TATATCTTTG AAGCTTACCT CAGGGAAAAT TGCTTCCTGC 
1080 

CTTAATGATA ATGCAAATGG ATATTTCAGT GGACACGTCA TCCCTGCATG CAA6AATTTA 
1140 

AGTCCAAGTG CTAAGCX3AAA AQAATCTAAA TCCCATAAAC ACCCAAAAAC TGTAATGGTT 
1200 

GAAAATATGC GA6CATATGA CAAGGGTAAC AGAACACGCA TATTGTTGGG GTCGAGGCCT 
1260 

CCGAATTGTA CAAACAAATG TCATGGTTGC AGTCCATGTA AGGCCAAGTT AGTTATTGTT 
1320 

CATCGTATTA TGCCGCAGGA GTATTATCCT CAGAGGTGGA TATGCAGCTG TCATGGCAAA 
1380 

ATCTACCATC CATAA 
1395 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1479 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIiOGY: linear 

(ii) MOLECULE TYPE; cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GAAGATCAAT CTGCCCAAAT TATGTTGGAC AGTGTTGTCG AAAAATATCT TAGATCGAAT 
60 

CGGAGTTTAA GAAAAGTTGA GCATTCTCGT CATGATGCTA TCAACATCTT CAATGTGGAA 
120 

AAGTATGGCG CAGTAGGCGA TGGAAAGCAT GATTGCACTG AGGCATTTTC AACAGCATGG 
180 

CAAGCTGCAT GCAA7VAACCC ATCAGCAATG TTGCTTGTGC CAGGCAGCAA GAAATTTGTT 
240 

GTATIACAATC TGTTCTTCAA TGGGCCATGT CAACCTCACT TTACTTTTAA GGTAGATGGG 
300 

ATAATAGCTG CGTACCAAAA TCCAGCGAGC TGGAAGAATA ATAGAATATG GTTGCAGTTT 
360 

GCTAAACTTA CAGGTTTTAC TCTAATGGGT AAAGGTGTAA TTGATGGGCA AGGA7VAACAA 
420 
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TGGTGGGCTG GCCAATGTAA ATGGGTCAAT GGACGAGAAA TTTGCAACGA TCGTGATAGA • 
480 

CCAACAGCCA TTAAATTCGA TTTTTCCACX3 GGTCTGATAA TCCAAGGACT GAAACTAATG 
540 

AACAGTCCCG AATTTCATTT AGTTTTTGGG AATTGTGAGG GAGTAAAAAT CATCGGCATT 
600 

AGTATTACGG CACCGAGAGA CAGTCCTAAC ACTGATGGAA TTGATATCTT TGCATCTAAA 
660 



AACTTTCACT TACAAAAGAA CACGATAGGA ACAGGGGATG ACTGC6TCGC TATAGGCACA 
720 



GGGTCTTCTA ATATTGTGAT TGAGGATCTG ATTTGCGGTC CAGGCCATGG AATAAGTATA 
780 

GGAAGTCTTG GGAGGGAAAA CTCTAGAGCA GAGGTTTCAT ACGTGCACGT AAATGGGGCT 
640 

AAATTCATAG ACACACAAAA TGGATTAAGA ATCAAAACAT GGCAGGGTGG TTCAGGCAT6 
900 

GCAAGCCT^TA TAATTTATGA GAATGTTGAA ATGATAAATT CGGAGAACCC CATATTAATA 
960 

AATCAATTCT ACTGCACTTC AGCTTCTGCT TGCCAAAACC AGAGGTCTGC GGTTCAAATC 
1020 

CAAGATGTGA CATACAAGAA CATACGTGGG ACATCAGCAA CAGCAGCAGC AATTCAACTT 
1080 



AAGTGCAGTG ACAGTATGCC CTGCAAAGAT ATAAAGCTAA GTGATATATC TTTGAAGCTT 
1140 

ACCTCAGGGA AAATTGCTTC CT6CCTTAAT GATAATGCAA ATGGATATTT CAGTGGACAC 
1200 



GTCATCCCTG CATGCAAGAA TTTAAGTCCA AGTGCTAAGC GAAAAGAATC TAAATCCCAT 
1260 



AAACACCCAA AAACTGTAAT GGTTGAJiAAT ATGCGAGCAT ATGACAAGGG TAACAGAACA 
1320 



CGCATATTGT TGGGGTCGAG GCCTCCGAAT TGTACAAACA AATGTCATGG TTGCAGTCCA 
1380 



TGTAAGGCCA AGTTAGTTAT TGTTCATCGT ATTATGCCGC AGGAGTATTA TCCTCAGAGG 
1440 

TGGATATGCA GCTGTCATGG CAAAATCTAC CATCCATAA 
1479 



(2) INFORMATION FOR SEQ ID NO:10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GGGTCTAGAG GTACCGTCCG TCCGATCGAT CCATT 
35 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

AATGATCGAT GCT 
13 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

RTAYTTYTCN ACRTTRAA 
18 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GGGTCTAGAG GTA 
13 



(2) INFORMATION FOR SEQ ID N0:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Phe Asn Val Glu Lys Tyr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHTUIACTERISTICS : 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CCTGCAGTAY TTYTCNACRT TRAANAT 
27 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



CCTGCAG 
7 



(2) INFORMATION FOR SEQ ID NO:17: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

lie Phe Asn Val Glu Lys Tyr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 18: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) liENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

CCTGCAGTAY TTYTCNACRT TRAADAT 
27 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GCNATHAAYA THTTYAA 
17 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHZ^RACTERISTICS : 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Ala lie Asn lie Phe Asn 
1 5 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
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GGAATTCCGC NATHAAYATH TTSfAAYGT 
28 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



GGAATTCC 
8 



(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAOIENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Ala lie Asn lie Phe Asn Val 
1 5 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

GCYTCNGTRC ARTCRTGYTT 
20 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Lys His Asp Cys Thr Glu Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHTUIACTERISTICS : 
15 (A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

GGCTGCAGGT RCARTCRT6Y TTNCCRTC 
28 

(2) INFORMATION FOR SEQ ID NO; 27: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : s ingl e 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



GGCTGCAG 
8 



(2) INFORMATION FOR SEQ ID NO: 28: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

55 (v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28; 

Asp Gly Lys His Asp Cys Thr 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 29: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

ATGTTGGACA GTGTTGTCGA A 
21 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

GGGAATTCAG AAAAGTTGAG CATTCTCGT 
29 

(2) INFORMATION FOR SEQ ID NO:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

GGGAATTC 
8 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
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GTTCTTCAAT GGGCCATGT 
19 

(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

6TGTTAGGAC TGTCTCTCGG 
20 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

TGTCCAGGCC ATG6AATAAG 
20 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

GCCTTACATG GACTGCAACC 
20 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

TCCACGGGTC TGATAATCCA 
20 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) liENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

AGGCAGGAAG CAATTTTCCC 
20 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

TACTGCACTT CAGCTTCTGC 
20 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

GGGGGTCTCC GAATTTATCA 
20 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECOLE TYPE: cDNA 



(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 40: 

GGATATTTCA GTGGACACGT 
20 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

30 TATTAGTiAGA CCCTGCGCCT 

20 

(2) INFORMATION FOR SEQ ID NO:42: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: cDNA 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

CCATGTAAGG CCAAGTTAGT 
20 

50 (2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

60 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

ACACCTTTAC CCATTAGAGT 
65 20 
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(2) INFORMATION FOR SEQ ID NO:44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

CTGTCCAACA TAATTTGGGC 
20 



(2) INFORMATION FOR SEQ ID N0:45: 



(i) SEQXJENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



35 CATCGCAGGG TGGTTCAGGC 

20 

(2) INFORMATION FOR SEQ ID NO: 46: 

40 (i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



TAGCCCCATT TACGTGCACG 
20 



(2) INFORMATION FOR SEQ ID NO: 47: 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

TTGGGGTCGA GGCCTCCGAA 
20 

(2) INFORMATION FOR SEQ ID NO:48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

TAAAAUGGC 
9 

(2) INFORMATION FOR SEQ ID NO:49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

AACAADGGC 
9 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

GCCGAATTCA TGGCCATGAA ATTAATT 
27 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

GCCGAATTC 
10 9 



(2) INFORMATION FOR SEQ ID NO: 52: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 

(ii) MOLECUIiE TYPE: cDNA 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

CGGGGATCCT CATTATGGAT GGTAGAT 
27 

30 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: cDNA 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 



CGGGGATCC 
9 



(2) INFORMATION FOR SEQ ID NO: 54; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 
55 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

60 (v) FRAGMENT TYPE: internal 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
Phe Thr Phe Lys Val Asp Gly lie lie Ala Ala Tyr Gin 
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10 



(2) INFORMATION FOR SEQ ID NO:55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Asn Gly Tyr Phe Ser Gly His Val He Pro Ala Cys Lys Asn 
15 10 
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Claims : 

1. A nucleic acid having a nucleotide sequence coding for a Japanese Cedar 
pollen allergen Cry j II, or at least one antigenic fragment thereof, or die 
functional equivalent of said nucleotide sequence. 

2. A nucleic acid of claim 1 wherein said nucleotide sequence consists 
essentially of at least one ftagment of the coding portion of the nucleotide 
sequence of Fig. 4 (SEQ ID NO: 1). 

3. A nucleic acid of claun 2 wherem said fragment comprises bases 108 through 
1586 (SEQ ID NO: 9) of the nucleotide sequence of Fig. 4 (SEQ ID NO: 1). 

4. A nucleic acid of claim 1 wherem said nucleotide sequence consists 
essentially of the nucleotide sequence of Fig. 4 (SEQ ID NO: 1). 

5. A nucleic acid of claim 1 wherein said fragment comprises bases selected 
from the group consistmg of bases 177 through 1586 (SEQ ID NO: 7) of the 
nucleotide sequence of Fig. 4, and bases 192 through 1586 (SEQ ID NO: 8) 
of the nucleotide sequence of Fig. 4 (SEQ ID NO: 1). 

6. An e>q)ression vector comprising a nucleotide sequence coding for a Japanese 
cedar pollen allergen Cry j n, or at least one antigenic fragment thereof, or 
the fimctional equivalent of said nucleotide sequence. 

7. An expression vector of claim 6 wherem said nucleotide sequence consists 
essentially of at least one fragment of the coding portion of the nucleotide 
sequence of Fig. 4 (SEQ ID NO: 1). 

8. An expression vector of claim 6 wherem said nucleotide sequence comprises 
bases 108 through 1586 (SEQ ID NO: 9) of the nucleotide sequence of Fig. 
4. 

9. A host cell transformed to express a protein or peptide encoded by the nucleic 
acid of claim 1. 

10. Isolated Cry j n protem, or at least one antigenic fragment thereof, produced 
in a host cell transformed with the nucleic acid of claim 1 . 
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An antigenic fragment of claim 10 which does not bind immunoglobulin E 
specific for a Japanese cedar pollen allergen, or if bmding of said antigenic 
fiagment to said immunoglobulin E occurs, such binding does not result in 
histamine release from mast cells or basophils. 

An antigenic fragment of claim 10 which binds inmiunoglobulin E to a 
substantially lesser extent than purified, native Cryj n protein binds said 
immunoglobulin E. 

Isolated Cry j II protein of claim 10 wherein the host cell is E.coli . 

A method of producing Cry j n protein, or at least one fragment thereof, 
comprising the steps of: 

a. culturing a host cell transformed with a DNA sequence encoding Cryj 
n protein or fragment thereof, in an appropriate medium to produce a 
mixture of cells and medium containing Cry j II protein or at least one 
fragment thereof; and 

b. purifying said mixture to produce substantially pure Cryj n protein, 
or at least one fragment thereof. 

A protein preparation comprising Cry j n protem, or at least one fragment 
thereof, synthesized in a host cell transformed with a nucleic acid comprising 
a nucleotide sequence encoding all or a portion of Cryj n. 

A protein preparation of claim 15 wherein said at least one fragment of Cryj 
n is an antigenic fragment. 

A protein preparation comprising chemically synthesized Cry j n protein, or 
at least one fragment thereof. 

A protein preparation of claim 15 wherein said Cryj n protein comprises an 
amino acid sequence shown in Fig. 4 (SEQ ID NO: 2). 

A protein preparation of claim 17 wherein said Cry j n protein comprises an 
amino acid sequence shown in Fig. 4 (SEQ ID NO: 2). 
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20. An isolated peptide comprising at least one T cell epitope of Cry j U. 

21 . An isolated peptide of claim 20 which as minimal immunoglobulin E 
stimulating activity. 

22. An isolated peptide of claim 20 which does not bind immunoglobulin E 
specific for a Japanese cedar pollen alleigen, or if binding of the peptide to 
said immunoglobulin E occurs, such binding does not result in histamine 
release from mast cells or basophils. 

23. An isolated peptide of claim 20 which binds immunoglobulin E to a 
substantially lesser extent than purified native Cry j TL protein binds said 
immunoglobulin E. 

24. Isolated Cry j U protein, or an antigenic fragment thereof, which modifies, in 
an individual sensitive to Japanese cedar pollen to whom it is administered, 
the allergic response of the individual to a Japanese cedar pollen allergen. 

25. Isolated Cry j TL protein or antigenic fragment of claim 24 which modifies B- 
cell response of the individual to a Japanese cedar pollen allergen, T-cell 
response of the individual to a Japanese cedar pollen allergen, or both the B- 
cell response and the T-cell response of the individual to a Japanese cedar 
pollen allergen. 
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Modified Cry j n protein or at least one modified fragment thereof, which 
when administered to an individual sensitive to Japanese cedar pollen, 
reduces the allergic response of the individual to Cry j II. 

A therapeutic composition comprising isolated Cry j n protein, or at least one 
fragment thereof, and a pharmaceutically acceptable carrier or diluent. 

A therapeutic composition of claim 27 wherein said Cry j n protein 
comprises an amino acid sequence shown in Fig. 4 (SEQ ID NO: I). 

A method of treating sensitivity to a Japanese cedar pollen allergen, or an 
allergen immunologically cross-reactive with a Japanese cedar pollen 
allergen, in an individual sensitive to said allergen, comprising administering 
to the individual a therapeutically effective amount of the composition of 
claim 27. 

A method of detecting sensitivity in an individual to a Japanese cedar pollen 
allergen, comprising combining a blood sample obtained from the individual 
with isolated Cry j U protein, or antigenic fragment thereof, produced in a 
host cell transformed with the nucleic acid of claim 1 or chemically 
synthesized, under conditions appropriate for binding of blood components 
with the protein or fragment thereof, and determining the extent to which 
such binding occurs. 

A method of claim 30 wherein the extent to which binding occurs is 
determined by assessing T cell function, T cell proliferation, B cell function, 
binding of the protein or fragment thereof to antibodies present in the blood 
or a combination thereof. 

A monoclonal antibody, polyclonal antibody or immunoreactive fragment 
thereof, specifically reactive with Cry j n protein, or at least one antigenic 
fragment thereof. 

Cry j n protein isolated from Japanese cedar pollen, said protein having a 
molecular weight of about 40 kD as determined by sodium dodecyl sulfate- 
polyacrylamide gel electrophoresis. 
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34. A host cell transformed with a vector containmg the cDNA msert of Cry j n, 
said host cell having ATCC deposit number 69105. 



35. 

5 



A recombinant DN A molecule comprising a DNA coding for a polypeptide 
having at least one epitope of the protem allergen, Cryj n. 
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V A K K L I 



70 80 9C iOO ilO 120 

I I I ! ' ^ j. 

Cvcc kJ- TGGCCTTTCTCGCC ATGCAATTGATTATAATGGCGGCAGCAGAAG ATCA-rtTCTG 

ap'kaflamqliimaaaedqs 

10 20 

130 140 150 160 170 180 

i i t I I . ' 

CCCAAATTATCTTGGACAGTCTTGTGGAAAAATATCTTAGATCGAATCGGAGTTTAAGAA 

AQIMLDSVVEK YLRSNRSLR 
30 40 

190 200 210 220 230 240 

I I I I • ' 

AAGTTGAGCATTCTCGTCATGATGCTATCAACATCTTCAATGTGGAAAAGTATGGCGCAG 

KV E HSRHDA INIFNVEK YGA 
50 60 

250 260 270 280 290 300 

I I I I I I 

TAGGCGATGGAAAGCATGATTGCACTGAGGCATTTTCAACAGCATGGCAAGCTGCATGCA 

VGDGKHDCTEAFSTAWQAAC 
70 80 

310 320 330 340 350 360 

I I I I I < 

AAAACCCATCAGCAATGTTGCTTGTGCCAGGCAGCAAGAAATTTGTTGTAAACAATCTGT 

KNPSAM LLVPG SKKFVVNNL 
90 100 • 

370 380 390 400 410 420 

I I I I I • 

TCTTCAATCGGCCATGTCAACCTCJVCTTTACTTTTAAGGTAGATGGGATAATAGCTGCGT 

F?NG P C Q PH FTFK VDG I I AA 
110 120 

430 440 450 460 470 480 

I I I I I I 

ACCAAAATCCAGCGAGCTGGAAGAATAATAGAATATGGTTGCAGTTTGCTAAACTTACAG 

Y QN P A S W KNNR IW L Q F A K L T 

130 140 
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1090 1100 1110 1120 1130 .1140 

GCACTTCAGCTTCTCCTTCCCAAAACCAGAGGTCTGCGGTTCAAATCCAAGATC^^ 
CTSASACQNQRSAVQIQDVT 

350 

1150 1160 1170 1180 1190 1200 ' 

ACAAGAACAkcGTCGGAciTCAGCTACAGCAGCAGCAATTCAACl^^ 
YKNI RG TS ATAAAIQL KC S u 
370 380 

1210 1220 1230 1240 1250 1260 

GTATGCCCTGCAAAGATATAAAGCTAAGTGATATATCTTTGAAGCTTACCTC^^ 
SM PC KD IKLSDISLKLTSGK 

390 

1270 1280 ■ 1290 1300 1310 1320 

I I I I ■ ' 

TTGCTTCCTGCCTTAATGATAATCCAAATGGATATTTCAGTGGACACGTCATCCCTGCAT 

lASCLNDNANGYFSGHVI PA 
410 " 420 

1330 1340. 1350 1360 1370 1380 

I I I I ' ' 

GCAAGAATTTAAGTCCAAGTGCTAAGCGAAAAGAATCTAAATCCCATAAACACCCAAAAA 

CKNLS PSAKRKESKSHKHPK 
430 . 440 

1390 1400 1410 1420 1430 1440 

I I I I • ' 

CTGTajiTGGTTGAAAATATGCGAGCATATGACAAGGGTAACAGAACACGCATATTGTTGG 

TVMVENMRAYDKGNRTRILL 
450 460 



WOM/11512 



6/18 



PCr/US93/11000 




1510 1520 1530 1540 1550 156Q 

'"agtt;.ttg'1tcatcgta4atcccgcag(Ugtattatcctcagaggtggat^^^^ 

L V I V H R I M P 0 E Y Y P Q R ^ ^ - 

490 

1570 1580 1590 1600 1610 1620 
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TTCTTCTGGiACAATATTAGAACTCATATTGAAAATAAATCATC^ 

1690 1700 1710 1720 

I I ■ I 

TTATAATAGATTATATTAATGGTrCAGCCTGGTGCAAAAAAAAAAA-3 ' 
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Cry j II RKVEKSRHDAINIFNVEKYQA 

RKVEHSRHDAIN I FNVEK YGA 
SRHDAINIFNVEKYGA 
AINIFNVEKY 



Long 
Short 
Sakaguchi 
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80 90 
Cry j U VGDGKHDCTEAFSTAWQAACKNPS 

VGDGKHDCTEAFSTA W(Q )K N P( ) 

VGDGKHDCTEAFSTAW(Q )KNP() 



Long 
Short 
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